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YEASAYERS AND NAYSAYERS: 
AGREEING RESPONSE SET AS A PERSONALITY VARIABLE’ 


ARTHUR COUCH anp KENNETH KENISTON 


Harvard University 


SYCHOLOGISTS have realized for a number 

of years that various response sets have 

considerable influence on most objective 
psychological test scores (Lorge, 1937). But 
only in the past few years, inspired by Cron- 
bach’s two articles (1946, 1950), has there been 
any systematic study of this biasing effect of 
response set and its implications for question- 
naire validity. 

Many of these investigations, recently sum- 
marized by Jackson and Messick (1958), have 
dealt with agreeing response set, defined as a 
tendency to agree (or disagree) with items re- 
gardless of their content. Psychological tests 
are frequently based on Likert-type scales with 
response alternatives ranging from Strongly 
Agree to Strongly Disagree. The agreeing 
tendency is of crucial importance with response 
scales of this type, since it is intrinsically in- 
volved in the method of measurement itself. 
In recent years, there has been a growing con- 
cern about the degree to which this agreeing 
set permeates the objective measures of many 
well known personality variables. Many studies 
have investigated the influence of agreement 
tendency on the “Authoritarian” F scale 
(Bass, 1955; Christie, Havel, & Seidenberg, 
1958); on the variables on the MMPI (Fricke, 
1956; Barnes, 1956), and on vocational inter- 
ests (Roulon, 1931; Berdie, 1943; Holland, 
1958). 

A review of the literature reveals two general 
approaches to agreeing response set. From one 
viewpoint, response set is seen as primarily a 
statistical nuisance that must be controlled or 
suppressed by appropriate mathematical tech- 
niques. The agreement tendency is considered 
a rather superficial and ephemeral set, chiefly 
important as a mathematical factor that is 
compounded in scale measurements. While the 
statistical properties and consequences of the 


1 This investigation was supported in part by re- 
search grant M1287 from the National Institute of 
Mental Health, USPHS. It was conducted at the 
Harvard Psychological Clinic Annex as part of a 
project directed by Henry A. Murray 


response set are given prominent attention, 
little interest is focused on the psychological 
determinants of the agreeing set in the indi- 
viduals taking the tests. Cronbach, despite 
qualifications in other publications (1949, p. 
51), summarizes this position in a recent dis- 
cussion of accuracy of person perception: “In 
other instruments, the elevation score—the 
average response on all items—represents a 
trivial response set, such as the tendency to 
say ‘yes’ to both favorable and unfavorable 
questions. In that case, the dimension should 
probably be removed from the data” (1958, 
p. 366). 

A second approach is represented by several 
researchers interested in agreeing response set 
as a manifestation of the responders’ person- 
ality, interests, or personal “style” (Berdie, 
1943; Jackson & Messick, 1958). Many of 
these investigators, though they view response 
set as a reflection of personality, postulate that 
agreement tendency is merely what it 
“appears” to be—a general “acquiescence” 
tendency that results in agreement with the 
rather general and authoritative statements of 
most questionnaires. Bass (1955) suggests that 
the response set is an indication of “social 
acquiescence”; while Leavitt, Hax, and Roche 
(1955) believe that the set is an “agreement 
with things authoritative.” This view of re- 
sponse set as general acquiescence has led to 
the conclusion that many well-studied variables 
such as ‘‘authoritarianism” can be primarily 
explained in terms of the agreement phe- 
nomenon. 

A somewhat different position is taken in 
the present study. Although response set is 
indeed a mathematical factor having major 
repercussions on objective test scores, exclu- 
sive concern with these statistical consequences 
may distract from the equally significant prob- 
lem of the psychological determinants of re- 
sponse set. The basic hypothesis of this study 
is that response set is a manifestation of a 
deep-seated personality syndrome—whose un- 
derlying determinants serve to explain the 
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phenotypical phenomenon of “acquiescence” 
or “agreement.” 

This study attempts to analyze agreeing re- 
sponse set as a personality variable—using 
both objective test measurements and a pro- 
gram of clinical assessment. The first part of 
this report outlines the development of a 
reliable and valid measure of agreeing response 
set as it manifests itself in questionnaire items. 
We then show that this measure has the 
stability and generality which would char- 
acterize it as an index of an underlying per- 
sonality dimension. This agreement measure 
was correlated against a series of objective 
test scales and items. The pattern of results 
suggested a set of hypotheses concerning the 
personality differences between individuals 
who tend to agree (yeasayers) and those who 
disagree (naysayers). The second part of this 
report presents a set of hypotheses about the 
personality dynamics behind the agreement 
tendency and attempts to validate these for- 
mulations by a clinical study of subjects with 
marked tendencies to “agree” and “‘disagree.” 
Results of a structured interview and sentence- 
completion test given to these “‘yeasayers” and 
naysayers enabled us to verify and conceptu- 
alize more clearly the inner dynamics behind 
this “agreeing” personality syndrome. A gen- 
eral interpretation and discussion of the re- 
sults obtained in both parts of the study is 
presented in the concluding section. 





Part I: QUESTIONNAIRE STUDIES 

The initial problem was to obtain a valid 
measure of response set, which we have de- 
fined as a general “tendency to agree or dis- 
agree with questionnaire items, regardless of 
their content.” It is obviously important that 
the measure of response set not be “‘con- 
taminated” by the undue presence of any 
specific cluster of content variables, as would 
certainly happen if the amount of agreement 
on any one or two standard psychological test 
scales were considered an adequate indicator of 
this general response tendency. 

In order to create an appropriate measure of 
agreement tendency that was not affected by 
“content” variables, we decided that the ques- 
tionnaire items included in it would have to 
fulfill three criteria: (@) the items should 
measure heterogeneous content variables from 
widely different psychological areas; (5) these 
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content variables should be selected so that the 
total set is psychologically “‘balanced”’ to insure 
an over-all diffuseness of content; and (c) the 
total number of individual items iu the over-all 
response measure should be very large (at least 
300) so that the general response tendency has 
sufficient opportunity to manifest itself by 
permeating the answer pattern in a consistent 
way. With these purposes in mind, we began 
our study. 


Development of the Over-all Agreement Score 
(OAS) 


Subjects. As part of the assessment procedure in the 
selection of Ss for a clinical research project, a large 
battery of objective personality tests was administered 
to 61 paid volunteer students from Harvard College. 
These Ss had been selected on formal criteria (male, 
sophomore, satisfactory grades) from 200 volunteers 
enrolled in an undergraduate course in the social 
sciences. 

Procedure. Most of the objective test items were 
answered in a series of three weekly sessions in the 
fall; a final group of tests was given later in the spring. 
The Ss gave responses to 681 items, using a scale of 
the Likert type. The response categories were given 
values from 1 to 7, and were worded as follows: 


(2) (2) (3) (4) 
Strongly Disagree Slightly No 
Disagree Disagree Answer 

(5) (6) (7) 

Slightly Agree Strongly 
Agree Agree 


The items included many from previously developed 
tests, such as Davids’ Affect Questionnaire (1955), 
the factorial scales from Couch and Bales’ Value 
Profile (1955), the Authoritarianism F Scale (Adorno, 
Frenkel-Brunswik, Levinson, & Sanford, 1950), Thur- 
stone’s Temperament Schedule (1949), Cattell’s 16 
P.F. Personality Inventory (Cattell, Saunders, & 
Stice, 1957), Rokeach’s Dogmatism Scales (1956), 
and Bass’ Social Acquiescence Scale (1956). In ad- 
dition to these tests, well over 200 new items 
were included in an attempt to develop factor scales 
for a variety of psychological concepts in the areas 
of orality, anality, introspectiveness, psychological 
inertia, depression, elation, superego integration, 
insightfulness, self-concept, and attitudes towards 
American culture. 

To control the effect of content variables in our 
measure of agreeing response set, we based it entirely 
on 360 selected items out of the total of 681 items of 
the Likert-scale type. In an attempt to fulfill our 
criteria for a response measure unaffected by specific 
content, we chose items from a large number (30) of 
heterogeneous scales that had “‘psychological” opposites, 
such as Trust-Distrust, Dependency-Self-sufficiency, 
Rationalism-Religiousness, Optimism-Pessimism, etc. 
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By including an equal number of items on “opposite” 
ends of each of the dimensions measured, we hoped to 
“cancel out” the effect of any content variable on the 
Over-al] Agreement Score. This diversity and “psycho- 
logical balancing” of content reduced the possibility 
that the measure of agreeing tendency would be unduly 
influenced by some extraneous pattern of personality 
variables—a total score over all items makes no 
psychological sense. To the extent that this “content 
cancelling” has been achieved, the sum score of all 
360 item responses is a pure measure of agreeing re- 
sponse set, that is, of the S’s tendency to agree (or 
disagree) with items regardless of content. This approach 
should also reduce the possibility that correlations 
between the agreement measure and other scales or 
items are produced by content variables unwittingly 
built into the measure itself. 

An Over-all Agreement Score (OAS) was computed 
for each S by taking the mean of their responses to the 
360 items. The range of scores was from 3.1 to 4.5, 
with a mean value of 3.9 and a standard deviation of 
0.3. The first sign of the importance of this measure 
was indicated by the high (+.85) Spearman-Brown 
split-half (even-odd) reliability of the entire 360 item 
scale. The OAS thus provides a reliable measure of 
agreeing response set as defined in this study. 


Generality of the OAS 


Having established the statistical reliability 
of the OAS, it is necessary to demonstrate that 
this measure has empirical characteristics 
which support our contention that it measures 
a generalized agreement tendency. Since the 
OAS was based on an arbitrary selection of 
items, we considered that it was necessary to 
test its generality by correlating it with the 
grand sum of all 681 items of the Likert type 
in our questionnaire battery. This “grand sum” 
correlated +.94 with the OAS, a high value 
even considering the 53% overlap of items. 
Although the larger number of items also 
appears to serve the function of guaranteeing 
heterogeneity and content “elimination,” in 
this study we have used the OAS as the scale 
that most fully meets our criteria for a valid 
measure of response set. 

To test the extent that response set varied 
from one testing session to the next, we com- 
puted the correlations between the sum of 
““content-cancelled” items (120) from the first 
two weeks of testing and the sum of “‘content- 
cancelled” items (240) from the third week of 
testing. The correlation coefficient was +.73, 
indicating that agreeing and disagreeing re- 
sponse set operates fairly consistently over 
time, and is not merely the product of the 
S’s momentary mood. 














TABLE 1 
CORRELATIONS OF RESPONSE CATEGORIES 
wit OAS 
: Correlations 
Lik Res P F : 

~Categeries “Qfean) GSD)” ith OAS 
Strongly Agree 4.8 3.3 + .41 
Agree 17.1 4.6 + .55 
Slightly Agree 25.5 5.4 +,23 
No Answer 0.2 0.6 —.10 
Slightly Disagree 22.8 6.0 —.23 
Disagree 21.8 4.9 — .54 
Strongly Disagree 7.9 4.0 — .38 





The OAS as a mean score indicates the 
average point along the Likert scale around 
which the S tends to center his responses. 
However, this mean score does not tell us 
what actual response category preferences are 
associated with high and low OAS scores. To 
investigate these relationships, we computed 
the percentage frequency with which the Ss 
choose each of the six Likert response categories 
and then correlated these figures with the OAS 
scores. The mean and standard deviation of 
these percentage frequencies are presented 
along with the correlation coefficients in 
Table 1. 

These results demonstrate that the OAS is 
most highly related to a response preference 
for the categories of Agree (in a positive way) 
and Disagree (in a negative way). These cor- 
relations are affected by the relative contribu- 
tion to the OAS of each response category—as 
reflected in the Mean Percentage Frequency. 
This “part-whole” effect may explain the 
lower correlations for Strongly Agree and 
Strongly Disagree, but since other response 
categories have about the same mean fre- 
quency, the uniquely high correlation of the 
Agree and Disagree categories point to a real 
relationship of response preference and the 
agreement tendency. The preference for defi- 
nite “agreement” responses supports our 
interpretation of the OAS measure. It should 
also be noted that the frequency of using dis- 
agreement categories is an equally prominent 
indicator of the agreement tendency—only the 
direction of scoring is reversed. Thus, to lapel 
the response tendency agreement is only an 
arbitrary preference on our part; the opposite 
disagreeing response tendency would be an 
equally appropriate conceptualization. 

Yeasaying and true-saying. If this tendency 
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disagree is a manifestation of a 
should also 


to agree or 
general response set, this set 
permeate objective tests that do not employ 
Likert-scale items but use response categories 
that are similat in psychological meaning to 
agreement an disagreement. In order to test 
the generality uf response set, we investigated 
the relationship of the OAS with the agreement 
(marking true) on the MMPI 
(Minnesota Miultiphasic Personality In- 
ventory). About six months after the initial 
testing sessions, the Ss filled out the entire 
566-item MMPI. As a measure of a “true- 
saying” response tendency, we computed a 
Sum True score for each S by counting the 
total number of True responses on all items. 
This score, though perhaps also reflecting dis- 
positions other than response set, had a cor- 
relation of +.64 with the OAS. A correlation 
of this magnitude indicates that response set 
Agree-Disagree 
answer 


equivalent 


influences answers both to 
Likert True-False 
patterns. 

Yeasaying and yes-saying. To test further 
the generality of agreeing response set in the 
objective test domain, we examined the char- 
acteristic response pattern of our Ss on another 
standard personality inventory—Cattell’s 16 
P.F. Test. This test consists of Forms A and B, 
with a grand total of 374 items, of which 217 
(about 60%) have an answer pattern which 
offers the choices: Yes, In Between, No. We 
counted the total number of times each S had 
marked Yes on those 217 items where that 
response was possible. This Sum Yes score was 
then correlated with the OAS, giving a correla- 
tion of +.56. This finding adds support to our 
hypothesis that agreeing response set is a dis- 
position of considerable influence and stability, 
and indicates again that the OAS functions as 
a reliable and valid measure of this general 


items and to 


agreeing tendency. 
Correlates of the OAS 

The results obtained in developing the Over- 
all Agreement Score, namely the high relia- 
bility, stability over time, and generality of 
agreeing response set, support the view that 
this set is the manifestation of a personality 
syndrome. We therefore proceeded to treat the 
OAS as a criterion measure of this agreeing 
response set, and correlated the score with a 
series of objective test domains. This step-wise 


search for the correlates of the agreement re- 
sponse set gradually built up a consistent and 
meaningful pattern which characterized the 
personality of individuals who tend to “agree”’ 
(or conversely “disagree’’) on questionnaires 
regardless of item content. 

Relationship to the MM PI. There are intrin- 
sic difficulties in the task of relating a measure 
of agreeing response tendency to any series of 
objectively measured personality variables; for 
the response tendency itself is an integral part 
of the measurement technique involved in assess- 
ing the personality variable. This is especially 
true when the variable measured uses items 
scored mainly in one direction, when, for ex- 
ample, some variable X is scored by counting 
all items marked Agree, True, or Yes. 

On the MMPI we have already noted that 
the tendency to mark True is highly related 
to the general agreeing response set. For each 
of the 32 MMPI scales’ that were computed 
for each S, a tabulation was made of the per- 
centage of items in the scale which count 
towards a high score on that scale when marked 
True. All scales were then correlated with 
the OAS. These correlations and the Percentage 
True counts are presented in Table 2. 

The analysis presented in Table 2 illustrates 
the very close relationship between the amount 
of agreement response component inherent in 
the scale as measured by the Percentage True 
count and the correlation of that scale with our 
measure of agreement tendency. Scales that 
rely primarily on True responses for a high 
score tend to have positive correlations with 
the OAS, while scales relying on False re- 
sponses for a high score tend to have negative 
correlations. In fact, over the 32 standard 
scales, these two measurements have a rank 
order correlation of +.86, indicating that 
scales with an unusually high (or low) per- 
centage of items scored True are signifi- 
cantly influenced by the agreeing response set. 

Because of this possible source of “‘con- 
tamination,” an interpretation of the psycho- 
logical implications of the correlations between 
the MMPI scales and the OAS measure re- 
quires caution. The pattern of scales having 
significant positive and negative correlations 
with the OAS clearly suggests pervasive psy- 

2 Fuller descriptions of each scale and appropriate 
reference are available in the article by Kassebaum, 
Couch, and Slater (1959). 
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TABLE 2 
CORRELATIONS OF THE MMPI ScALES WITH THE 
OVER-ALL AGREEMENT ScorE (OAS) 


Percent-| Correlation 


Scales age with OAS 
“True” N = 61) 
ST Sum True 100 +.64** 
Pr Prejudice 88 + .51** 
{ Anxiety 97 + .50°* 
Dp Dependency 86 + .48** 
X2 Impulsivity 90 + .46** 
Pt Psychasthenia 81 + .45** 
Ma Mania 76 +.41** 
Se Schizophrenia 74 + .38** 
Fm Feminine Masochism 66 + .33* 
Hs Hypochondriasis 33 + .29* 
D Depression 35 +.22 
F Validity Scale 69 + .20 
Do Dominance 25 +.15 
Pd Psychopathic 48 + .15 
Si Social Introversion 49 + .07 
Pa Paranoia 63 + .06 
Sp Social Ability 68 + .03 
Mf Feminine Interests 47 + .01 
XJ Social Presence 25 — .06 
St Social Status 44 — .07 
Hy Hysteria 22 — .08 
Oi Originality 16 — .26* 
Te Intellectual Efficiency 38 — .28* 
Lp Leadership 28 — .31* 
R Repression Denial 0 — .34** 
Es Ego Strength 37 — .35** 
Ac Achievement via Con 22 — .35** 
formance 
K Suppressor Scale 3 — .38** 
Rp Role-playing Ability 41 — .40** 
L Lie Scale 0 — .42** 
1i Achievement via Inde 19 — .44** 
pendence 
Re Social Responsibility 28 - .48** 
To Tolerance 10 — .52** 


5 level of significance (r = .26 


** 01 level of significance (r = .33 
chological differences between yeasayers and 
naysayers. However, in so far as these cor- 
relations are related to the Percentage True 
counts of the scales, the importance of the 
interpreted “content” of these MMPI vari- 
ables may seem to be decreased. With the ex- 
ception of the few scales with approximately 
50% True counts—such as (Je) Intellectual 
Efficiency, (Es) Ego-strength, and (Rp) Role- 
playing Ability—none of the variables with 
significant correlations are free of the “‘con- 
taminating”’ effect of response set on their level 
of relationship with the OAS—thus presenting 
difficulties in using the correlations to identify 
the “psychological” characteristics of the 
agreement response tendency. 

Nonetheless, it should be remembered that 


most MMPI scales were developed by the use 
of “criterion” groups, and therefore the “‘psy- 
chological” nature of agreement response set 
would have contributed to the empirical pat- 
tern of item responses that differentiated indi- 
viduals high vs. low on the variable of the scale. 
We can probably infer that the original popu- 
lations of Ss high and low on Prejudice, 
Psychasthenia, Mania, Achievement via Inde- 
pendence, and the other concepts of the em- 
pirical scales had personality differences related 
to an “‘agreement” or “disagreement’”’ response 
tendency. In other words, the Percentage True 
of the empirical scale reflects personality 
differences in the criterion groups that are re- 
lated to the response tendency. To the extent 
that this inference is valid, the correlational 
results can be helpful in identifying the person- 
ality correlates of response set, despite the 
apparent “contamination” of the scale scores 
with response set. 

Table 2 shows a number of significant cor- 
relations of MMPI scales with the OAS. The 
positive correlations of (X2) Impulsivity, (Dp) 
Dependency, and (Ma) Mania with the OAS 
suggest that individuals high on the agreeing 
tendency show signs of an extrovertive im- 
pulsivity. The centrality of this general dimen- 
sion seems borne out by the cluster of high 
negative correlations with the variables: 
(R) Repression-suppression, (Re) Responsi- 
bility, (Ai) Achievement via Independence, 
and (Ac) Academic Achievement via Con- 
formance. All these scales point to introvertive 
intellectual interests, inhibitions, and a control 
of impulsivity. Another dimension which 
seems indicated is that of ego strength. Yea- 
sayers are high on such scales as (A) Anxiety 
and (Pt) Psychasthenia, while naysayers ob- 
tain high scores on the scales of (Es) Ego 
Strength and the (K) Supressor scale. In terms 
of a recent factor analytic study of the MMPI 
(Kassebaum, Couch, & Slater, 1959) this pat- 
tern of results places the agreeing response 
tendency along the axis of a fusion factor con- 
ceived as Impulsivity vs. Intellectual Control. 
The fact that the (Pr) Prejudice scale has the 
highest correlation (+.51) with the OAS and 
(To) Tolerance scale the lowest correlation 
(—.52) may be primarily explained by this 
main factorial dimension. Further implications 
of these findings are best discussed after more 
of the correlates of the OAS are considered. 
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TABLE 3 


CORRELATIONS OF THE CATTELL 16 P.F. 
wit THE OAS Score 


SCALES 


Correlation 
Scale with OAS 
(N = 61) 


A —Warm, Sociable vs. Aloof, Cold YY ing 
B Intelligent vs. Dull 

Cc Mature, Calm vs. Emotional, Unstable 
E —Dominant, Aggressive vs. Submissive 

F —Enthusiastic, Talkative vs. Silent 

G —Conscientious vs. Undependable 

H —Adventurous vs. Timid 

I —Sensitive vs. Tough 

L —Suspecting, Jealous vs. Trustful 

M —Eccentric vs. Conventional 

N —Sophisticated vs. Simple 

O —lInsecure, Anxious vs. Unshakable 


LFTTT TTT Tt | ++ 
S 


Q1—Critical vs. Conservative, Accepting 02 
Q2—Self-sufficient vs. Dependent —.17 
Q3—Controlled, Exact vs. Lax, Unsure — .35** 
Q4—Tense, Excitable vs. Phlegmatic, | +.38** 


Poised 


* Significant at the .05 level 


** Significant at the .01 level 


TABLE 4 
CORRELATIONS OF THURSTONE’S TEMPERAMENT SCALES 
WITH THE OAS Score 


Scal — Correlation with 
caie ercentage “ YAS 
(Factor) Scored “Yes (N = 61) 
A—<Active 65 4+ 13 
V —Vigorous 100 + 01 
I —Impulsive 90 4. 19 
D— Dominant 70 +-.14 
E—Stable 35 — .38* 
S —Sociable 80 +-.10 
R—Reflective 95 +-.16 


* Significant at the .01 level 


Relationship to the Cattell and Thurstone Per- 
sonality Tests. In Tables 3 and 4, we havé pre- 
sented the correlations of the OAS with each of 
the 16 standard scales (Forms A and B com- 
bined) in the Cattell 16 P.F. Test and with 
the seven factor scales of Thurstone’s Tempera- 
ment Schedule. The percentage of Yes re- 
sponses is presented for the Thurstone scales. 
These figures are not appropriate for the Cat- 
tell test which uses a large number of (a) vs. 
(b) alternative choices as item responses. 

In this battery of tests, the highest positive 
correlations with the OAS are with Cattell’s 
scales (L) Suspecting, Jealous vs. Trustful, (O) 
Insecure, Anxious vs. Unshakable, and (Q4) 
Tense, Excitable vs. Phlegmatic, Poised. The 
highest negative relationships are with Cattell’s 


(C) Mature, Calm vs. Emotional, Unstable, 
(Q3) Controlled, Exact vs. Lax, Unsure. In the 
Thurstone test, there is a significant negative 
correlation with the (E) Stable temperament 
scale—which also has the lowest percentage of 
Yes responses involved in its score. We note 
again the theme of emotional control vs. im- 
pulsivity. The naysayers present a picture of 
calm, controlled stability; while yeasayers seem 
more emotionally determined, openly admit- 
ting anxiety and distrustfulness. The highest 
correlation (+.39) is with the Suspecting, 
Jealous vs. Trustful Scale, indicating that a 
lack of trust is associated with the OAS. This 
datum reminds us that one should not too 
quickly generalize from the apparent “ac- 
quiescence” of the yeasayer. 

It is interesting to note the lack of relation- 
ship of the OAS to intelligence as measured by 
the (B) Intelligent vs. Dull scale (+.07). This 
pattern was verified by nonsignificant correla- 
tions of the OAS with SAT (Verbal Aptitude 
Subtest) (—.04) and MAT (Mathematical 
Aptitude Subtest) (.00) of the College Entrance 
Examination Board scores. In our population, 
it appears that intelligence is not an important 
aspect of agreeing response set. 

Relationship to Basic Disposition Scales. As 
part of other research at the Harvard Psycho- 
logical Clinic Annex, we had developed a set 
of factorial scales to measure basic dispositions 
in the areas of Orality, Anality, Alienation, 
and Psychological Inertia. In a series of unre- 
ported studies large domains of test items in 
each of these four areas were factor analyzed. 
Several reliable factor scales were developed, 
each possessing the statistical characteristics of 
factorial purity and domain orthogonality. For 
some variables, both Positive and Balanced 
scale measures are available. These latter scales 
are “balanced” in the sense that an equal 
number of items are scored “positively” and 
“negatively.” Thus, Balanced scales should 
serve to cancel out any contamination due to 
agreeing response set. The Positive factor 
scales, reported below, consist of items scored 
only in one direction. Table 5 presents the 
correlations of the OAS with these Basic Dis- 
position Factor Scales, subdivided into Bal- 
anced and Positive scales. 

This table permits us to compare the cor- 
relations of Balanced and Positive scales de- 
signed to measure the same concept. In every 
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TABLE 5 


CORRELATIONS OF THE Basic Disposition Factor 
SCALES WITH THE OAS 


Correlations 
Basic Disposition Factors with OAS 
(N = 61) 
Positive Scales 
Conventional Anxiety +.57** 
Conscious Hostility + .46** 
Alienation +.44** 
Distrust + .32* 
Pessimism +.18 
Optimism +.17 
Tr — .07 
Oral Aggression + .43** 
Oral Dependency + .40** 
Oral Involvement -+.14 
Oral Rejectiveness + .03 
Ora! Counterdependency — .03 
Anality Total Scale +.75** 
Anal Preoccupation +.75** 
Anal Submission + .67** 
Anal Rebellion +.61** 
Anal Concern with Power +.55** 
Anal Orderliness +.25 
Anal Messiness + .20 
Secondary Process Inertia—Excitability +> .s7°° 
Low Emotional Inertia +.56** 
Equilibrium Inertia—Emotional Variabil-| + .48** 
ity 
Inconstancy +.37** 
Quick Reactions + .33* 
Perseveration +.17 
Slow Reactions — .02 
High Psychological Inertia — .20 
Balanced Scales 
Distrust vs. Trust +.20 
Optimism vs. Pessimism +.01 
Oral Dependency vs. Counterdependency| +.25 
Anal Orderliness vs. Messiness + .07 
Emotional Inertia: High vs. Low — .43** 
Equilibrium Inertia: Emotional] Stabil- — .33* 
ity vs. Variability 
Secondary Process Inertia—Impulse — .30* 
Control vs. Excitability 
Quick Reactions vs. Slow Reactions +.20 
Inconstancy vs. Perseveration +.10 


* Significant at the .05 level. 
** Significant at the .01 level 


case where the comparison is possible, the cor- 
relations of Balanced scales with the CAS are 
less positive than that of their Positive scale 
counterparts, the average drop being .18. This 
difference is of course statistically determined 
by the fact that the agreeing response set is 
actually built into all Positive scales and 
systematically cancelled from Balanced scales. 
The comparison again points out the important 


contribution that agreeing response tendency 
makes to the variance of Positive scales, and 
demonstrates that “balancing-out” response 
set reduces the response set component in the 
scale score. 

With this effect of response set on the vari- 
ables kept in mind, the pattern of correlational 
results in Table 5 brings out new facets of the 
agreeing response tendency. Several of the 
recently developed Basic Disposition Factor 
scales have significantly positive correlations 
with the OAS. Many of these scales were de- 
rived from our factor analysis of the Anality 
domain. Since these scales are not yet described 
in the literature, we have included here a char- 
acteristic item for the scales mentioned. It can 
be seen that the highest positive correlations 
are for the dispositional scales of Anal Pre- 
occupation (“I enjoy exploring slums and 
alleys, dirty though they are”); Anal Rebellion 
(“I could really shock people if I said all of the 
dirty things I think’); and Anal Submissive- 
ness (“People need someone strong to lean 
on’’). These results suggest that the previously 
noted relationships of the OAS to impulsivity 
and rejection of emotional control may be part 
of a more basic dispositional complex centered 
around anality. We also note a positive rela- 
tionship to the scales measuring Alienation 
(“The idea of adjusting to society as now 
constituted fills me with horror’); and Con- 
ventional Anxiety (“There are few more 
miserable experiences than going to bed night 
aiter night knowing you are so upset that 
worry will not let you sleep”). These variables 
seem related to the general maladjustment and 
distrustfulness noted in the MMPI scales and 
in the Cattell factors. 

On the opposite end of theagreeing tendency, 
a group of Balanced scales in the area of psycho- 
logical inertia is found to characterize the 
naysayers. The factor scales with the highest 
negative correlations with the OAS are: 
Secondary Process Inertia (“I have to think 
things through thoroughly before I act’’); 
Equilibrium Inertia (‘Uncontrolled impulsive- 
ness is not part of my make-up’); and Emo- 
tional Inertia (“Whatever emotional mood I 
have is likely to stay with me for a long time’’). 
This pattern of general retention of impulses 
and the reluctance to respond quickly to situ- 
ational demands may very well represent the 
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opposing reaction to the anal expressive 
syndrome pointed out by the cluster of posi- 
tively correlated scales, and contribute a new 
meaning to the general emotional stability 
indicated by the previously reported correla- 
tional results. It may be that standard meas- 
ures of ego strength, adjustment, and maturity, 
when used on populations of normals, are 
related primarily to this kind of pervasive con- 
trol of impulses by dominant secondary 
processes. 

Relationship to Authoritarianism. Much of 
the previous research on agreeing response set 
has focused on its relationship with authori- 
tarianism. It has been argued that scores on 
the F scale are in large part a function of this 
agreeing tendency, with the implication that 
the original formulations concerning the au- 
thoritarian personality may be invalidated by 
this fact. To investigate this question, we made 
an especially intensive study of the relation- 
ship between the F scale and the OAS. 

The literature on authoritarianism and 
“acquiescence” often concentrates on the 
question of what percentage of F scale variance 
can be attributed to the agreeing response 
tendency. With the OAS, a fairly direct meas- 
ure of this percentage can be obtained from the 
correlation of the OAS with the F scale. But 
many objective measures of authoritarianism 
suffer from a lack of knowledge of their dimen- 
sional purity. In an effort to avoid these 
difficulties, we used as measures of authoritari- 
anism two scales that had been developed by 
Couch and Bales ('955) in their factor analysis 
of the Value Profile. These scales are factorially 
pure measures of authoritarianism, both having 
very high saturation on the first factor of their 
analysis which uniquely defines a 30-item F 
score with a loading of +.90. One scale, called 
the Positive F scale, consists of 10 items all 
scored in the positive direction: for example, 
“Obedience and respect for authority are the 
most important virtues children should learn.” 
The other scale, the Balanced F scale, consists 
of 5 items scored positively and 5 items scored 
negatively. The factorial purity of this scale 
is achieved by a balancing out of other factor 
components. Thus, negative items that are not 
pure “reversed” authoritarianism can be used 
when properly compensated for by complemen- 
tary factorially “impure’’ positive items. With 
this Balanced F scale, agreeing response set is 


TABLE 6 


CORRELATIONS OF AUTHORITARIANISM MEASURES 
WITH THE OAS 





Correlations with 


Factor Scales OAS (N = 61) 


Positive F scale + .37* 
Balanced F scale + .09** 
* Significant at the .01 level. 
** Significantly different from the above r = +.37 at the 
01 level. 


effectively cancelled out; thus, the correlation 
with the OAS should reflect the lack of this 
“contaminating” influence. The results of 
correlating these two factorially ‘‘equal’’ meas- 
ures of authoritarianism with the OAS are 
presented in Table 6. 

As Table 6 indicates, the Positive F scale 
correlates significantly with the OAS, sup- 
porting the view that the F scale, if measured 
by positive items, is influenced to some extent 
by agreeing response set. But the percentage of 
variance attributed to this agreement effect 
is only 14%, if we assume that the correlation 
with the OAS is an appropriate measure of this 
influence. More substantial evidence that 
“pure” authoritarianism is actually uneffected 
by agreement tendency comes from the nearly 
zero (+.09) correlation between the Balanced 
F scale and the OAS. Since both F scales are 
highly loaded on the authoritarianism factor 
and are significantly intercorrelated (+.56), 
these results suggest that the agreeing response 
tendency is not related to “pure” authoritari- 
anism when its measurement is not inherently 
contaminated by response set. In another 
section, we return to this issue in a more sub- 
stantive context where a factor analysis sup- 
ports the same conclusion. 

Relationship to Social Acquiescence and Dog- 
matism. We turn to the areas of social acquies- 
cence and dogmatism where previous research 
had suggested certain relationships to response 
set. Bass (1956) developed a Sociai Acquies- 
cence Scale in a manner somewhat similar to 
the method we used in obtaining our Over-all 
Agreement Score, although only aphorisms 
were involved in his scale. One-half of Bass’ 
scale (28 odd-numbered items) was included 
in our original test battery. In addition to these 
items, we gave Rokeach’s (1956) Dogmatism 
Scales, which consist of 40 items measuring 
different aspects of ideological and functional 
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TABLE 7 
CORRELATIONS OF SOCIAL ACQUIESCENCE 
AND DOGMATISM WITH THE OAS 


Correlations with OAS 
NV = 61) 


Scales 


Bass’ Social Acquiescence Scale +.30° 


Rokeach’s Dogmatism Scales 


Structure scale + .26* 
Content scale + .29* 
Function scale + .42** 
Total scale + .40** 


* Significant at the .05 level 
** Significant at the .01 level 


dogmatism. Table 7 presents the correlations 
of these scales with the OAS. 

The Social Acquiescence Scale correlates 
significantly (+.30) with the OAS, but at a 
lower level than would be expected if “‘acqui- 
escence”’ were actually a central variable in the 
agreeing response tendency. This rather low 
correlation is especially indicative, since all 
items are scored in the direction of agreement. 
Given the general pattern of positive correla- 
tions between other Positive scales and the 
OAS, this result suggests that the Social 
Acquiescence Scale is not distinctively related 
to agreeing response set. 

With the Dogmatism Scales, we found the 
highest correlation between the Function 
Scale and the OAS (+.42). The items here are 
of the type: “I would like it if I could find 
someone who would tell me how to solve my 
personal problems.” This scale attempts to 
measure the underlying function of dogmatism 
in the individual’s personality, especially in 
controlling the fear of aloneness and appre- 
hension about the future. The relationship 
between the Dogmatism Scale and the OAS 
appears to reflect the diffuse anxiety of the 
yeasayer, and to suggest that he seeks de- 
pendency on external figures for support. 


Development of the Agreement Response Scale 
As the next stage of our research we cor- 
related the OAS to each of the 681 items in 
our questionnaire battery. The item-to-OAS 
correlations ranged from —.21 to +.54; the 
mean correlation for all items was +.17. The 
high mean correlation demonstrates again the 
extent to which response set has influenced 
all item responses. This trend was also ap- 
parent from the fact that 88% of all items 


correlated positively with the OAS. Well over 
100 items had correlations with the OAS 
greater than +.30. Table 8 lists a sample of 
items with the highest positive relationship to 
the response tendency, along with the item 
means, standard deviations, and correlations 
with the OAS. 

These items with the highest positive cor- 
relations serve to identify the agreement 
response tendency on the most specific level 
of separate statements. Since the average of 


TABLE 8 
CORRELATIONS OF INDIVIDUAL ITEMS WITH THE OAS 
SCORE 
Correlation 
with OAS Mean| SD Items* 
(N = 61) 
+ .54 4.9 1.4 | Novelty has a great appeal 
to me. 

5 4.5 | 1.7 | I crave excitement. 

51 3.5 | 1.6 | It’s a wonderful feeling to 
sit surrounded by your 
possessions. 

.48 3.3 | 1.6 | There are few things more 
satisfying than really to 
splurge on something— 
books, clothes, furniture, 
etc. 

47 4.2 | 1.7 | Only the desire to achieve 
great things will bring a 
man’s mind into full ac- 
tivity. 

46 3.6 | 1.6 | Nothing is worse than an 
offensive odor. 

45 3.6 | 1.3 | In most conversations, I 
tend to bounce from topic 
to topic. 

44 2.9 | 1.6 I really envy the man who 


can walk up to anybody 
and tell him off to his face. 

44 4.1 | 1.8 | I could really shock people 
if I said all of the dirty 
things I think. 

42 4.3 | 1.7 | There are few more miser- 
able experiences than 
going to bed night after 
night knowing you are so 
upset that worry will not 
let you sleep. 


.42 3.8 | 1.6 | I tend to make decisions on 
the spur of the moment. 
41 3.8 | 1.8 | Little things upset me. 


w 


41 3.0 | 1.3.| Drop reminders of yourself 
wherever you go and your 
life’s trail will be well re- 
membered. 

.40 2.1 | 1.3 | I like nothing better than 

having breakfast in bed 

40 4.2 | 1.6 | My mood is easily influ- 

enced by the people 
around me. 


®* The 15 items listed here we now consider the best short 
scale measure of the agreeing response tendency. 
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these 15 item means (3.7) and of item standard 
deviations (1.6) is almost identical to the 
average mean for all 681 items (3.8) and their 
average standard deviation (1.5), no purely 
statistical factors are determining the correla- 
tional results. The items reflect to a great 
extent the same array of variables that we 
have already noted—impulsivity, anality, and 
instability. A more detailed analysis of these 
implications is presented in the final stages of 
the scale development. 

A selection of 20 items with the highest cor- 
relations to the OAS constituted our first 
approximation to an Agreement Response 
Scale (ARS). The split-half reliability of this 
scale was very satisfactory (+.86). With all 
items being scored in a positive direction, the 
response set is, of course, working in our 
favor—as is uniquely desired in this case. 
This 20-item scale (ARS) correlates +.79 
with the OAS, thus giving it considerable 
criterion validity. 

However, it is obvious that all of the statisti- 
cal uniqueness of our single sample is contribut- 
ing to this pattern of results for the ARS. 
We felt it necessary to test the internal con- 
sistency of the tentative scale by administering 
it to an independent population. A 120-item 
questionnaire was developed which included 
this ARS scale, as well as an array of items 
that we hoped would bring out new facets of 
the agreeing tendency. We also included a 
number of standard F scale items and several 
new ones written to measure “low” authori- 
tarian attitudes. 

The questionnaire was given to 54 students 
in an upper-level psychology course. For this 
new sample, the split-half reliability of the 
ARS remained quite high (+.72), indicating 
that our measure had the characteristics of 
internal consistency. As a next step in scale 
development, we did a factor analysis of the 
120 items. These items were intercorrelated 
and the resulting matrix factor analyzed by 
the Thurstone complete centroid method. Four 
factors were extracted, which altogether ac- 
counted for 28% of the item domain variance. 
By Thomson’s (1939) Pooling Square tech- 
nique, we computed the factor loadings of the 
total 20-item Agreement Response Scale, and 
then made orthogonal rotations so as to define 
the ARS by the first factor. 


TABLE 9 
Factor Loaprincs oF ITemMs ON AGREEMENT 
RESPONSE SCALE DIMENSION 


Factor I 


Loading Item 

+.66 Let us eat, drink, and be merry, for tomorrow we die 

+.63 I tend to make decisions on the spur of the moment 

+.61 There are few things more satisfying than really to 
splurge on something—books, clothes, furniture, 
etc. 

+. 57 Here today, gone tomorrow—that’s my motto! 

+. 50 Novelty has a great appeal to me 

+.49 My feelings about others fluctuate a good deal 

+ .48 Conscience is another name for fear. 

+.46 Movement, travel, change, excitement—that’s the 
life for me! 

+.45 I’m apt to really blow up, but it doesn’t last long 

+.42 It’s great fun just to mess around 

+.41 There’s nothing so satisfying as really to tell some- 
one off 

+.40 I really enjoy plenty of excitement 

—.61 One should not give free rein to the passions, but 


rather control and weigh them before expressing 


them 

—.51 I seldom if ever lose my temper 

-.49 Uncontrolled impulsiveness is not part of my make- 
up 

—.37 It’s hard to get me upset 

—.34 My speech is quite slow and deliberate 

—.33 I feel uncomfortable when people get too emotional. 

33 I almost never respond impulsively to people or 

events 

+. 86 Agreement Response Scale 


With the assumption that the ARS re- 
mained a valid measure of agreeing response 
set, the first factor of this new analysis may be 
considered a dimensional representation of 
our central variable. In Table 9 we have listed 
those items with the highest positive and nega- 
tive loadings on Factor I, which might best be 
labelled the “Agreement Response Scale” 
dimension. 

A difference in emotional responsiveness 
seems to be the central concept of the agree- 
ment dimension. Items with high positive 
loadings on this Agreement Response Scale 
factor tend to emphasize an impulsive accept- 
ance of immediate internal stimuli (“I’m apt 
to really blow up, but it doesn’t last long”’) 
and the desire for external stimuli in the form 
of emotional sensations and _ thrill-seeking 
(“Novelty has a great appeal to me’’). This 
pattern might be summarized by the term 
“stimulus acceptance,” indicating a tendency 
to enjoy and yield to immediate, situational, 
short-term forces of both an internal and ex- 
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ternal nature. The items with high negative 
loadings stress just the opposite, namely, 
rejection of these same emotional and en- 
vironmental pressures. The central theme of 
the naysaying position is the effort to control 
and minimize internal and external forces 
seeking expression. Naysayers emphasize ego 
control of libidinal impulses (“One should not 
give free rein to the passions, but rather control 
and weigh them before expressing them”) as 
well as delay and inhibition of responses to 
provocative press in the environment (“I 
am not easily provoked’’). 

Importance of Item Tone. An examination of 
items with high loadings on the agreement 
factor suggested the importance of item tone in 
the distinction between yeasaying and naysay- 
ing characteristics. The positive items indicated 
that the yeasayers’ impulsiveness was revealed 
most clearly in statements whose ttem tone 
was extreme and enthusiastic. Exclamatory 
phrases and colloquial slang seemed to have 
intrinsic appeal to the personality of the yea- 
sayer: for example, “Here today, gone 
tomorrow—that’s my motto!” and “It’s great 
fun just to mess around!” On the opposite 
end of the agreeing response dimension, we 
noted that items with negative loadings were 
phrased in guarded, qualified, and cautious 
language. Items such as “I seldom if ever lose 
my temper,” and “I almost never respond 
impulsively to people or events,” point out the 
degree that “careful,” “realistic,” and in- 
tellectually qualified phrasing are inherent in 
the item tone of naysaying items. We feel that 
this aspect of item phraseology is highly related 
to the agreement response tendency on ques- 
tionnaires. If the item has an enthusiastic, 
colloquial tone, it will serve to attract the 
yeasayer and to repel the naysayer. On the 
other hand, a qualified and hesitant statement 
will appear reasonable and bring forth agree- 
ment from the generally disagreeing naysayer, 
while provoking the generally effusive yeasayer 
to one of his infrequent disagreements. Al- 
though the psychological content of ‘“expres- 
sion” vs. “control” of emotional impulses 
certainly influences the language of the state- 
ments used for measurement, the characteristic 
phraseology of items high on this factor sug- 
gests that i/em tone itself contributes to eliciting 


special agreement from yeasayers and dis- 
agreement from naysayers. 

The factor analysis serves to support the 
conclusions we had arrived at from the previous 
investigation of the empirical correlates of 
the Over-ail Agreement Score in the realm of 
objective personality scales. The factor dimen- 
sion of the Agreement Response Scale shows a 
conceptual similarity to the pattern of 
variables that were highly correlated to 
the OAS. 

Factorial Relationship of Agreement Response 
Scale to Authoritarianism. A number of items 
in the area of authoritarianism were included 
in this factorial study in order to clarify further 
the relationship of agreeing response tendency 
to this variable. In addition to the regular 
“positive” F scale items, we added several new 
items designed to measure “low” authoritari- 
anism in an agreement direction. The second 
factor was rotated through the F scale meas- 
ures. This two-dimensional factor pattern of 
Agreement and Authoritarianism is presented 
in Fig. 1. 

The factor plot of item loadings on Factors 
I and II shows that the dimension of the 
Agreement Response Scale was orthogonal to 
the dimension of authoritarianism. The Posi- 
tive F scale items were scattered around the 
plus end of Factor II, while our new Negative 
F scale items defined the opposite pole. Several 
items were loaded on beth main factors and 
these served to define the “fusion” locations 
of the factor space. The ARS items were split 
into two types by the authoritarianism dimen- 
sion; those expressing a Desire for External 
Stimulation were associated with high authori- 
tarianism, while items favoring an Expression 
of Unacceptable Internal Impulses were associ- 
ated with low authoritarianism. On the oppo- 
site end of Factor I, items measuring the Dis- 
agreement tendency were also divided into two 
different types. High authoritarianism was re- 
flected in items favoring a Control of Unac- 
ceptable Internal Impulses; while low authori- 
tarianism was reflected in items favoring an 
Avoidance of External Stimulation. 

The factor pattern of loadings illustrated 
in Fig. 1 supports our earlier finding that 
“pure” authoritarianism is independent of the 
agreeing response set. The measure of the 
agreement tendency, the Agreement Response 
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Fic. 1. Schematic representation of “agreement’’ and “authoritarianism’”’ factors 


Scale, uniquely defines the first dimension 
that is orthogonal to the second dimension of 
“authoritarianism Although some of the 
agreement items had a component of desiring 
stimulation, while others reflected expressing 
unaccepiable impulses, these items taken to- 
gether balanced out the second factor and left 
the ARS a “pure” measure of response set. 
The very fact that the ARS contains these two 
complementary external and internal com- 
ponents supports our previous interpretation 
that Stimulus Acceptance vs. Stimulus Rejec- 


” 


tion represents the more abstract central con- 
cept involved in agreement response set. 
Development of “Low” F Scale. Our research 
on the agreement response tendency suggested 
a new approach for the construction of a “re- 
versed” or “low” F scale. It is clear that if 
items on opposite ends of the authoritarian 
dimension all contain a substantial component 
of the response set variable, then these “‘oppos- 
ing” scales will tend toward a positive rela- 
tionship regardless of the reversal of authori- 
tarian content which would tend to cause a 
negative relationship. Other researchers (Bass, 


1955; Jackson & Messick, 1958) have tried 
this type of “content reversal” with only 
limited success in obtaining negatively related 
scales. Our research has pointed out the impor- 
tance of “item tone”’ in the agreement response 
tendency. Since the phraseology of an item 
contributes to the amount of agreement or 
disagreement which it attracts, we hypothe- 
sized that by manipulating the item tone of 
low authoritarian items so as to attract 
naysayers and repel yeasayers, we could 
increase the negative relationship between the 
“high” and “low” authoritarian scales. Several 
“nonauthoritarian” statements were therefore 
phrased in a “naysaying manner” that empha- 
sized qualification and a lack of exclamatory 
enthusiasm. The five items of this kind with 
the highest negative loadings in the previous 
factor analysis formed a new “Low F scale.” 
These ‘‘naysaying” type items are listed below. 


“‘Navsaying”’ Low F scale 


1. People tend to place too much emphasis on 
respect for authority. 
2. It may well be that children who talk back to 
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their parents actually respect them more in the long 
run. 

3. I seldom have any enthusiasm for respect and 
obedience for authority. 

4. It would be preferable if there were less en- 
thusiasm about strict discipline, rugged determination, 
and the will to work and fight for family and country 


5. I would not myself consider patriotism and 
loyalty to be the first requirements of a good citizen 


We combined this 5-item “naysaying”’ Low 
F scale with five items of the Positive F scale 
to form a new 10-item Balanced F scale. Thus, 
for this second population we had three dif- 
ferent measures of authoritarianism: (a) the 
previously used (10-item) Positive F scale, 
(6) a new (10-item) Balanced F scale, and 
(c) a new (5-item) Low F scale. The correla- 
tional results indicated that we had been 
quite successful in constructing a “better” 
reversed F scale than has previously been 
reported. Our Naysaying Low F scale cor- 
related highly negatively (—.70) with the 
Positive F scale. These “low” items also im- 
proved the Balanced F scale, as evident from 
its extremely high correlation (-++.93) with the 
Positive F scale. 

The correlations of these three F scales 
with the Agreement Response Scale are pre- 
sented in Table 10. 

None of the correlations of the F scales 
indicate any significant relationships to the 
ARS—turther supporting our view that these 
dimensions are unrelated. We were puzzled 
that in this new sample the Positive F scale 
had no positive correlation to our new response 
tendency measure. Population differences 
may be involved, but other evidence, such 
as the slight positive relationships of “low” 
F scale items to the Agreement Response Scale, 
suggested that the new response scale was 
slightly “rotated” from the position expected 
from our original OAS criterion. But the ARS 
still seems quite adequate® as a “short” sub- 
stitute measure; the general pattern of results 
reflects all the previous findings with the 
OAS measure. 

Considering the correlations between re- 
sponse measures and authoritarian scales in 
our two samples, and also considering the 
orthogonality of the variables in the factor 
analysis of items, the best conclusion is that 

* Additional validation of the Agreement Response 


Scale is provided by the correlation of +.61 with the 
sum of all 120 items in the second questionnaire battery 


TABLE 10 


CORRELATION OF VARIOUS F SCALES WITH 
AGREEMENT RESPONSE SCALE 











ae Canyon b> ARS 
Positive F scale +.00 
Balanced F scale (new) — .04 
Low F scale (new) +.16 





“pure” authoritarianism has little or no 
relationship to the agreeing response tendency. 


PRELIMINARY FORMULATION OF PERSONALITY 
DETERMINANTS FROM QUESTIONNAIRE 
RESULTS 


From our investigation of the agreeing 
response tendency in the questionnaire realm, 
we have arrived at a fairly consistent picture 
of the variables that differentiate yeasayers 
from naysayers. Yeasayers seem to be “‘id- 
dominated” personalities, with little concern 
about or positive evaluation of an integrated 
control of their impulses. They say they express 
themselves freely and quickly. Their ‘“‘psycho- 
logical inertia” is very low, that is, very few 
secondary processes intervene as a screen 
between underlying wish and overt behavioral 
response. The yeasayers desire and actively 
search for emotional excitement in their en- 
vironment. Novelty, movement, change, ad- 
venture—these provide the external stimuli 
for their emotionalism. They see the world 
as a stage where the main theme is an “acting 
out” of libidinal desires. In the same way, 
they seek and respond quickly to internal 
stimuli: their inner impulses are allowed ready 
expression. They have little tolerance for a 
delay of gratification: immediate reduction is 
sought for any tension that may have arisen. 
Thus, to both external and internal stimuli, 
the yeasayers’ general attitude is one of stimu- 
lus acceptance, by which we mean a pervasive 
readiness to respond affirmatively or yield 
willingly to both inner and outer forces 
demanding expression. 

The “disagreeing” naysayers have the op- 
posite orientation. For them, impulses are 
seen as forces requiring control and perhaps, 
in some sense, as threats to general personal 
stability. The naysayer wants to maintain 
inner equilibrium; his secondary processes are 
extremely strong in this ego maintenance of 
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the balance between emotionally impulsive and 
value-maintaining forces. We might describe 
this as a state of high psychological inertia— 
impulses undergo a series of delays, censor- 
ships, and transformations before they are 
permitted expression. Both internal and ex- 
ternal stimuli that demand response are 
carefully scrutinized and evaluated: these 
forces appear as unwelcome intruders into a 
subjective world of “‘classical” balance. Thus, 
as opposed to the yeasayers, the naysayers’ 
general attitude is one of stimulus rejection—a 
pervasive unwillingness to respond to im- 
pulsive or environmental forces. 

This formulation of the dynamics of the 
response tendency was developed from an 
analysis of the empirical correlates of the 
Over-all Agreement Score. As such, the formu- 
lation suffers from all the limitations of psycho- 
logical interpretation based on objective paper- 
and-pencil tests. Since the response tendency 
itself influences the objective test measure- 
ments by which we initially identified its na- 
ture, there were in this case additional reasons 
for further investigation. We felt that our 
study of the personality dynamics and deter- 
minants of agreeing response set would benefit 
from a clinical analysis of extreme yeasayers 
and naysayers. Therefore, a clinical study was 
designed to validate these theoretical formu- 
lations about the personality syndrome deter- 
mining this general response set. 


Part II: CrirntcaL Stupy oF YEASAYERS AND 
NAYSAYERS 


The research in Part I led us to the formu- 
lation that a personality syndrome may help 
to explain the psychological dynamics of 
agreeing response tendency. It seemed evident 
that for many persons the tendency to agree 
or disagree was actually a significant and 
functional part of their personality structure. 
The pattern of variables correlated with our 
two measures of the agreement tendency 
(CAS and ARS) suggested a set of hypotheses 
concerning the deeper dynamics and deter- 
minants of this pervasive response set. A 
clinical assessment of Ss on the extreme ends 
of the dimension would allow us to test the 
main theoretical tenets of this “personality 
syndrome” view of the agreement response 
tendency. 


Hypotheses about Personalty Differences 


Specific predictions were made about dif- 
ferences between yeasayers and naysayers in 
seven major areas: (A) External vs. Internal 
Orientation; (B) Id-ego-superego Integration; 
(C) Impulse Control; (D) Hostility and 
Distrust; (E) Intellectual Orientation; (F) 
Secondary Process; and (G) Conflict Mani- 
festations. These hypotheses were tested by a 
research design employing a sentence comple- 
tion test and clinical interviews to be described 
below. 


(A) External vs. Internal Orientation. The yeasayer 
is externally oriented with strong tendencies towards 
extraception and extraversion; the naysayer is in- 
ternally oriented, intraceptive, and introverted. 

A-1 Yeasayers have superficial, conventional iden- 
tifications; naysayers’ identifications are deeper, 
less conventional, and defined more by internal 
values. 

A-2 Yeasayers are anti- or nonintraceptive; nay- 
sayers are intraceptive and psychologically 
minded. 

A-3 Yeasayers are oriented outward, towards the 
group values; naysayers are oriented inward, 
towards their own values. 

A-4 The self-concept of yeasayers is behaviorally 

defined and evaluated by conventional stand- 

ards; that of the naysayers is subjectively 
defined and evaluated by personal standards. 

5 Yeasayers have little concern with values 
and ideals, which if held at all are conventionally 
realistic and superficial; naysayers are con- 
tinuously influenced by values which are felt 
as an integral part of themselves. 

(B) Id-ego-superego Integration. The personality 
structure of yeasayers has a high degree of com- 
partmentalization of id-ego-superego; naysayers have 
a more integrated personality with much stronger ego 
control. 

B-1 The behavior of yeasayers is determined more 
by external factors in the immediate situation; 
naysayers are determined more by internal 
control and response to inner subjective reflec- 
tions. 

B-2 Yeasayers tend to externalize the source of 
their problems; naysayers tend to feel that 
they themselves are to blame for their problems. 

B-3 Ego strength is low in yeasayers; high in 
naysayers. 

B-4 Yeasayers tend to have weak, externalized 
superegos; naysayers are more guilt prone, 
having strong, internalized superegos. 

B-5 Yeasayers show a low perseveration of mood 
and emotional cathexes; naysayers have a 
high perseveration of their feelings and attach- 
ments. 

(C) Impulse Control. Yeasayers are able to express 

their emotions and impulses freely and openly without 
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inhibition; naysayers tend to suppress, repress, and 
control their impulses by strong ego domination. 

C-1 Yeasayers express aggression and _ hostility 
immediately and openly; naysayers express 
hostility mostly in a covert way, while open 
aggression is extremely controlled. 

C-2 Yeasayers express affection, sex, tenderness 
more directly and easily than naysayers who 
are more constricted, inhibited, and repressed 
in the expression of libidinal impulses. 

C-3 Yeasayers have anal expulsive characteristics 
and respond to “anally colored” situations 
with typical expressive reactions; naysayers 
have anal retentive characteristics and show 
typical controlled reactions. 

C-4 Yeasayers were socialized by punitive, physical 
methods; naysayers were socialized by depriva- 
tion of love and rational persuasion. 

C-5 Yeasayers tend to have emotionally intense, 
“Dionysian” expressive behavior; naysayers 
stress an avoidance of strong feelings and favor 
a balanced, “Apollonian” form of expression. 

(D) Competition and Distrust. The yeasayer tends 
to admit a distrust of others’ altruism and to accept 
competition as a “realistic” part of life; naysayers 
have a “reactive” trust of others and try to minimize 
competition and conflict. 

(E) Intellectual Outlook. The intellectual orientation 
of the yeasayers tends to be loose and emotional; 
naysayers are more consistently intellectual and have 
a high regard for rationality. 

(F) Secondary Process. The yeasayer tends to have 
a minimum degree of secondary processes intervening 
between inner “stimulus” and manifest “response”’; 
the naysayer has long secondary processes causing 
considerable delay in emotional responsiveness. 

(G) Conflict Manifestations. Yeasayers show more 
evidence of conflict, ambivalence, and disparity between 
values and behavior than naysayers. 


Method in Clinical Study 


Subjects. From our original population of 61 Ss, the 
10 with the highest scores on the Over-all Agreement 
Score (yeasayers) and the 11 with the lowest scores 
(naysayers) were selected for the clinical study. The 
mean OAS score of the extreme yeasayers was 4.3 
(range 4.2 to 4.5); the naysayers’ mean was 3.6 (range 
3.1 to 3.7) 

Procedure. The yeasayers and naysayers were divided 
into two groups which were interviewed by one or 
another of the two Es. The OAS score of every S was 
carefully concealed from the Es. The interviews were 
oriented around a 55-item sentence completion test‘ 
which had been designed to test the major hypotheses 
outlined above. Several sentence “stems” in each 


‘The Sentence Completion Test used in this study 
has been deposited with the American Documentation 
Institute. Order Document No. 6134 from ADI 
Auxiliary Publications Project, Photoduplication Serv- 
ice, Library of Congress, Washington, D. C., re- 
mitting in advance $1.25 for microfilm or $1.25 for 
photocopies. Make checks payable to Chief, Photo- 
duplication Service, Library of Congress. 


hypothesis area were constructed to elicit responses 
that would differentiate yeasayers and naysayers along 
the lines of our theoretical formulations. 

At the beginning of each individual interview, each 
S took the sentence completion test. The Z then dis- 
cussed the meaning of the S’s responses with him and 
tried to obtain information concerning the major areas 
of the authors’ hypotheses. These interviews lasted 
from 2 to 4 hr. The Ss were free to amplify, illustrate, 
or qualify their responses, and were frequently en- 
couraged to digress when interesting themes appeared. 
The Es took extensive notes on the amplification of 
each response and also wrote short clinical evaluations 
of the Ss in the different personality areas under study. 

After all interviews were completed, both Zs made 
independent ratings of sentence completion responses. 
Each E made 4 ratings of each of the 55 responses of 
the 21 Ss on the basis of the following criteria: 

1. Verbal content of response: Using only the 
written response, each item was rated on a 5-point 
scale (+2, +1, 0, —1, —2) which indicated the 
degree that the response corresponded to a “theoreti- 
cal” yeasaying (4-) or naysaying (—) orientation. 

2. Degree of secondary process: On the basis of 
the written response alone, Es used the 5-point 
scale to rate the degree that each response was 
impulsive and immediate (+) showing a direct and 
uncensored reaction to the item; as contrasted with 
reflective responses (—) showing hesitation, qualifica- 
tion, rationalization, or other evidences of secondary 
processes. 

3. Degree of conflict: Using both written responses 
and additional elaboration from the focused in- 
terview, each item was rated on a scale from 0 to 
+2 for the degree of conflict, ambivalence, and 
disparity between values and behavior. 

4. Clinical assessment: Taking into account the 
elaboration and clinical material concerning each 
item, a clinical rating was made using the 5-point 
scale that assessed the degree of correspondence to 
a “typical” yeasaying or naysaying response. 
A number of precautions were taken to reduce the 

possibility of interviewer bias in these ratings. After 
discussing the theoretical basis of the rating scheme, 
the Es did not discuss their ratings with each other. 
All identification was removed from the data. Es’ 
ratings for each item were made separately; that is, all 
responses to an item were rated for all 21 Ss before 
proceeding to another item. Thus, the Zs did not know 
whom they were rating or what ratings the S’s other 
responses had been given. 

As a first step in the analysis of the data, the Spear- 
man-Brown reliability of the ratings was studied. 
Table 11 presents the reliability coefficients for the sum 
of Es’ clinical ratings for the variables in each of the 
major hypothesis areas. 

For the sums of subareas and for individual items, 
reliability coefficients were somewhat lower; the average 
for subareas was +.69, for individual items +.61. 
On the whole interrater reliability seemed very satis- 
factory; therefore, ratings of the two Es were pooled 
to provide the basic data for the authors’ subse- 
quent analysis. 
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TABLE 11 
RELIABILITY COEFFICIENTS FOR INTERVIEW 
RATINGS # 


Verbal Con Clinical 
Major Hypothesis Area _ ee et 
. (N = 21) 

A—External-Internal! Orienta- .57 .68 

tion 
B—Id-ego-superego Integration .80 86 
C—Impulse Control .95 .89 
D—Competition and Distrust 47 .60 
E—Intellectual Outlook .56 83 
F—Secondary Process 

(sum for all items) 76 os 
G—Conflict Manifestations 

sum for all items) _ 65 


Results of the Clinical Analysis 

The main analysis of the data was a series of 
t tests on the difference of means between the 
groups of extreme yeasayers and naysayers. 
The tests were performed for each of the major 
hypothesis areas, and for the subareas that 
were stated in the previous section. In Table 
12, the results of the ¢ tests are given. Each 
hypothesis area is stated in a yeasaying form, 
so that a positive ¢ value indicates that 
extreme yeasayers have a higher mean than 
extreme naysayers. After each hypothesis, we 
have indicated in parentheses the number of 
sentence completion items included by design 
in testing that particular hypothesis. The 
results for ratings using only the verbal con- 
tent of the sentence completions, and for 
ratings that include clinical assessment are 
tabulated separately. 

In the five major areas of personality—(A) 
External-Internal Orientation, (B) Id-ego- 
superego Integration, (C) Impulse Control, 
(D) Competition and Distrust, and (E) In- 
tellectual Outlook—for which we had formu- 
lated systematic hypotheses, all results for 
the clinical assessment ratings were significant 
at the .01 level or greater. The analysis of the 
verbal content ratings shows the same pattern 
at somewhat lower levels of significance. For 
the subarea hypotheses, 9 out of 15 tests on 
the clinical ratings were significant at the .05 
level or greater, with particularly high ¢ 
values for Anal Expulsive Expression. These 
results indicate that ratings that take into 
account the additional material from the inter- 
view (Clinical ratings) differentiate the 


extreme groups better than the ratings based 
only on the written answers to the sentence 
completion items (Verbal Content ratings). 
The hypothesis that yeasayers would show 
less evidence of Secondary Processes than 
naysayers was supported at only the .10 level 
of significance (¢ = +1.62). However, addi- 
tional evidence from the data supports the 
hypothesis: of the 55 items involved in these 
ratings, 45 are in the predicted direction, giving 
a highly significant chi square of 21.9. 
Despite the positive nature of most results, 
some hypotheses were not confirmed. The 
supposition that Conflict Manifestations would 
be greater for yeasayers was not at all sup- 
ported (¢ = —0.65). The expected Conven- 
tional Identification of yeasayers was not 
indicated by either the Verbal Content ratings 
(¢ = +040) nor by the Clinical ratings 
(¢ = +0.94). In the same way, the 
hypothesized Externalization of Problems, 
Weak Superego, and Punitive Physical Sociali- 
zation, as characteristics of yeasayers, were not 
significantly supported by the data. 


Discussion of Clinical Findings 


The clinical study clearly supports the 
main lines of the hypothesized personality 
syndrome behind the agreeing response 
tendency. The analysis of interview ratings 
showed that yeasayers and naysayers were 
significantly differentiated in the areas of 
External-Internal Orientation, Id-ego-superego 
Integration, Impulse Control, Competition 
and Distrust, Intellectual Outlook, and 
Secondary Processes. These personality differ- 
ences appear in many diverse areas of func- 
tioning. The generality of the manifestations 
supports our contention that response set is 
based on an underlying dynamic syndrome. 
The nature of the syndrome makes it possible 
to explain a marked response set to the 
“stimuli” of questionnaire items as a func- 
tional consequence of inner forces in the re- 
sponding individual. Questionnaire items are 
another kind of stimuli demanding response. 
As such they bring forth typical reactions of 
immediate acceptance from yeasayers and of 
inhibited resistance from naysayers. 

External Orientation. Many facets of the 
difference in External Orientation between 
yeasayers and naysayers were made evident 
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TABLE 12 


Tue, Tests OF DIFFERENCES BETWEEN EXTREME YEASAYERS AND NAYSAYERS 


Hypothesis Area 


4—External Orientation (10) 


A-1 Conventional Identification (2) 
A-2 Anti-introception (2) 

4-3 Outward Group Orientation (2 
A-—4 Conventional Self-concept (2) 
A-5 Superficial Values and Ideals (2) 


B—Lack of Id-ego-superego Integration (10 
B-1 Situational Determinism of Behavior (2 


B-2 Externalization of Problems (2) 
B-3 Low Ego Strength (2 
B-4 Weak Superego (2 

) 


B-5 Rapid Changes in Emotions (2 
C—Impulse Expression (25 
1 Overt Expression of Hostility | 
2 Overt Expression of Affection ( 
Anal Expulsive Expression (5) 
Punitive Physical Socialization (5) 
Intense “Dionysian” Expressiveness (5 


wu 


Cc 
C 
C-3 
C-4 
C-5 


D—Acceptance of Competition and Distrust (5 


E—Lack of Intellectual Outlook (5 


F—Low Secondary Processes (55) 


verbal content alone 


G—Conflict Manifestations (55 
clinical rating alone 


* Pooled / tests were 
naysayers f 
> 


** The .01 level of significance is 2.43 


by observation of their general manner and 
behavior during the interview situation. It 
was apparent from our clinical reports that 
yeasayers were much more extraverted in 
their social orientations. They were usually 
more voluble, spoke more easily about them- 
selves (though often with little depth), took 
suggestions readily, reacted in more obvious 
and overt ways to their moods, appeared less 
complex psychologically, tended to wander 
from the subject at hand, and made them- 
selves thoroughly at ease with the interviewer. 
Naysayers, on the other hand, were char- 
acteristically more introverted. They were 
often shy and always reserved about them- 
selves. Throughout the interview, they re- 
quired more questioning and prodding; they 
felt easily criticized, weighed and considered 
each answer, and tended to express themselves 
in indirect and complicated ways—perhaps a 


Verbal Content 


Ratings Clinical Ratings 
+1.89* +3.89** 
+-0.40 +0.94 
+1.86* +2.33* 
+1.67 +4.21°* 
—0.10 +1.96* 
+3.19** +2.94** 
+1.48 +2.52** 
+1.10 +2.27* 
—Q0.14 +1.28 
+1.77* +1.55 
—0.19 +0.61 
+1.31 +1.70 
+2.35* +3.03** 
+2.36* +3.27** 
+1.74* +2.27* 
+2.69** +3.28** 
+0.57 +1.09 
+1.02 +2.47** 
-+-3.01** +3.30** 
+1.74* +3.22** 
+-1.62 — 

- —0.65 


used for testing the significance of the differences in means between ratings for (10) yeasayers and (11) 


For 19 df the .05 level of significance for a one-tailed test is 1.73 


consequence of the fact that for many of them 
introspection and self-analysis seemed virtually 
a full time occupation. By their introverted 
behavior and general lack of social spontaneity, 
the naysayers tended to create a task oriented 
atmosphere and usually initiated and main- 
tained an intellectual relationship with the 
interviewers. 

Impulse Control. Our hypotheses concern- 
ing Impulse Control, which had been fully 
supported by the empirical analysis of Clinical 
ratings, were verified more indirectly by our 
observations during the interview. In dis- 
cussing impulsivity and the expression of 
feeling, yeasayers and naysayers were es- 
pecially different. Yeasayers talked with 
enthusiasm about their impulsivity and their 
love of emotionally exciting and novel situa- 
tions. Sexuality and aggressive feelings were 
discussed quite openly, often with exhibition- 
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istic exaggeration. Yeasayers sometimes ex- 
pressed the fear that they were excessively 
uncontrolled and attributed their interpersonal 
difficulties to impulsiveness, hastiness, or 
unthinking rashness. Naysayers, on the other 
hand, were often defensive and embarrassed 
about emotional impulsivity and their manner 
of expressing inner feelings. They were fre- 
quently worried about their shyness and lack 
of social poise. They seemed concerned about 
their excessive emotional constraint and lack 
of spontaneity. Yet beneath this surface of 
emotional constriction, there appeared to be a 
depth of strong feelings which could only come 
out in close personal relationships. 
Intellectual Orientatien. A striking difference 
in the Intellectual Orientation of naysayers 
and yeasayers was apparent throughout the 
interview. Naysayers had a pervasively ra- 
tional and intellectual approach to whatever 
topic was discussed, while yeasayers were 
typically less conceptual, viewing the world 
in terms of emotional reactions and concrete 
instances. In exploring this area, we asked the 
subjects about their attitude towards ques- 
tionnaires. Yeasayers and naysayers described 
how they approached the task of answering 
Likert scale items in consistently different 
ways. The yeasayer responded to the 
“surface” of the items’ meaning, did not 
reflect on their possible full implications, and 
thus gave impulsive, immediate answers. This 
pattern is consistent with the yeasayer’s prefer- 
ence for blunt, straightforward, and extreme 
opinions, which reflect his emotional feelings 
at the moment. In quite an opposite manner, 
the naysayer analyzes and dissects each item, 
considering it from several points of view and 
attempting to be “logical” in his responses 
to the highly “interpreted” statements of the 
questionnaires. As was noted in our discussion 
of item tone, there seems to be an affinity of 
the yeasayer and naysayer for items phrased 
in his characteristic style of thought. Yea- 
sayers are themselves enthusiastic, colloquial, 
and given to extreme opinions. Naysayers are 
guarded, qualified, and somewhat negative in 
their intellectual orientation. When the phrase- 
ology of a questionnaire item approaches one 
of these patterns, it presents a differential at- 
traction to these two types of individuals. 
These differences in verbal style and intellec- 


tual approach contribute to the opposing 
patterns of the extraverted, fun loving, emo- 
tional yeasayer and the introverted, serious- 
minded, and rational naysayer. 

Personality Integration and Conflict. In the 
area of Id-ego-superego Integration our hy- 
potheses were supported on an over-all basis, 
but some of the subhypotheses were be- 
low normal significance levels, in particular, 
Externalization of Problems and Weak Super- 
ego. A similar error had been made in our 
hypothesis about personality conflict. The 
ratings of Conflict Manifestations had failed 
to show the evidence of ambivalence, denial, 
and dissociation that we had expected to dif- 
ferentiate yeasayers and naysayers. In 
attempting to explain these negative findings, 
we noted that all of these hypotheses had 
been derived from the questionnaire results 
that portrayed yeasayers as anxious, psychas- 
thenic, schizoid, unstable, and generally mal- 
adjusted. Our clinical study indicates that the 
implications of these scale correlations must 
be re-evaluated. From the interview, it was 
apparent that yeasayers were more able and 
wiling to admit their “unacceptable” impulsive 
feelings; whereas naysayers were unusually 
susceptible to denial mechanisms and a general 
inhibition or repression of libidinal and ag- 
gressive desires. In discussing competitive 
situations and aggression, the naysayers were 
often unaware of hostilities which seemed 
obvious to the interviewers. Their “reactive” 
trust had, at times, the characteristics of 
repressed hostility and resentment. On the 
other hand, the yeasayers were in much closer 
contact with their unacceptable impulses and 
had fewer inhibitions about revealing them 
than the naysayers. 

It is thus probable that the yeasayers’ 
higher scores on “maladjustment” 
which had suggested our hypothesis of dis- 
sociation and conflict, are chiefly attributable 
to their greater frankness and willingness to 
admit impulsive feelings and ambivalences. 
Clinical results also suggest that the naysayers 
may be presenting a ‘“‘mature”’ picture colored 


scales, 


by strong denial mechanisms. Two of our 
naysaying Ss later in the year developed rather 
severe neurotic disturbances requiring thera- 
peutic treatment; the questionnaire results 
gave woefully inadequate indications of these 
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trends. We would now reformulate our hy- 
potheses about these aspects of dissociative 
tendencies to predict the type, but not the 
degree of conflict or maladjustment. The 
agreeing and disagreeing dimension seems 
indicative only of the central area where 
psychogenic deviation may occur: yeasayers’ 
problems center around inadequate controls, 
while naysayers’ are focused on excessive in- 
hibition. To the extent that yeasayers lack 
personality integration, they deviate in the 
direction of “id domination” associated with 
an externalized superego and ineffectual con- 
trols. On the other hand, the naysayer who 
becomes maladjusted should tend towards 
‘superego domination,” frequently main- 
tained by emotional dissociation and repressive 
controls. 

Conventionality. In the area of superego 
strength and conventional values, another 
set of hypotheses was not fully supported. 
Besides the Weak Superego hypothesis already 
mentioned, two others, Conventional Identi- 
fication and Punitive Physical Socialization, 
indicated an error in prediction, based in part 
on the premise of dissociation discussed above, 
but also on the assumption of greater ‘“‘c 


‘ 


con- 
ventionality” for the yeasayer. This latter 
assumption seems to be in error. Contrary to 
prediction, many naysayers had conventional 
religious ideologies that were strongly inter- 
nalized ; naysayers were also more conventional 
in many areas of social behavior and attitude 
than the more “liberated” and exhibitionistic 
yeasayers. Interview material also indicated 
that the type of identification with the 
socializer was more relevant to the response 
set dimension than the particular method of 
socialization. The parents remain external 
figures to the yeasayer; whereas the naysayer 
identifies more completely with them. Yea- 
sayers seemed freer to express their impulses 
with little concern for interpersonal conse- 
quences. Yet these same yeasayers had 
“strong” but rigid superegos which had little 
influence on their everyday behavior, except 
in the sense that this “control” had to be 
opposed and subverted. They experienced 
guilt as something external to their central 
selves. Naysayers, on the other hand, had 
superegos firmly integrated into their ego 
ideals. In our original formulations, we had 


erred in stressing superego strength, instead of 
the degree of value internalization. It is the 
degree of internalization and not the degree of 
conventionality that determines the differen- 
tiating influence of values on the behavior of 
yeasayers and naysayers. 

Awareness of Response Set. Toward the end 
of each interview, the topic of response set was 
brought up explicitly. The Ss were reminded 
of the questionnaires they had taken earlier in 
the year, and were asked whether they had, in 
general, agreed or disagreed more with the 
items. More than three-fourths of the Ss 
characterized their response set accurately; 
one was inaccurate and the rest had no recol- 
lection. Most Ss reported some awareness of 
response bias as they filled out questionnaires. 
Several commented that they had consciously 
compensated for their tendency to mark pre- 
dominantly in one direction or the other, but 
as is evident from the basis of their selection 
as extreme yeasayers and naysayers, in all 
cases the original response set had prevailed. 


GENERAL INTERPRETATION AND CONCLUSION 
Statistical Implications of Response Set 

Our research originated with a realization 
that agreeing response set was of far-reaching 
importance in the realm of objective ques- 
tionnaire tests. The usual assumption of 
psychological test construction is that agree- 
ment or disagreement with a scale of items 
measures primarily the variable for which the 
test was designed. But many of the statistical 
techniques used in scale development result 
in a contamination of the final scale with an 
unwanted component of response set. No 
matter how a scale is developed, it is virtually 
impossible to eliminate the effect of agreement 
bias in an S’s answer pattern. As a consequence 
questionnaire items seem almost “possessed” 
by a tendency to correlate positively with one 
another regardless of content. The S is, in part, 
responding to the “meaning” of the item, but 
also, in part, to the answer scale associated 
with the item. Although this answer scale is a 
necessary part of the measurement technique,® 

5 Some measurement techniques such as Thurstone’s 
Paired-Comparison Method, Guttman Scale checking, 
forced choice alternative responses may avoid some 
of the intrinsic effects of the agreement tendency. But 
even here the response bias may appear in subtle ways, 
such as a tendency to check every item or the most 
extreme item. 
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it also evokes the agreement response tendency 
that subsequently becomes inextricably inter- 
woven into the item score. It is quite possible 
that many factor analyses of item domains 
have revealed a first centroid factor which 
might best be explained as a manifestation of 
response set. 

This same phenomenon is evident with 
multi-item Positive scales which tead to 
have suspiciously high positive intercorrela- 
tions with one another. The diagnosis is again 
“response set,” operating all the more power- 
fully when several items are summed in the 
same scoring direction to measure an intended 
variable. These scale scores also measure in 
varying and unknown degrees a component of 
the agreement response tendency. It is this 
“common factor”’ of response bias which often 
causes the “operational” measurements of 
variables to correlate positively, regardless of 
the degree of psychological “opposition” of 
the measured concepts. The sum of any group 
of Likert scale items scored positively will 
always be a somewhat valid, though perhaps 
unintended, scale for response set. These 
Positive scales will show the effects of this 
additional component, partly by being prone 
to positive interrelationships with other Posi- 
tive scales, and partly by reflecting in their 
pattern of highest correlations some of the 
psychological aspects of the agreement per- 
sonality dimension. Since all intercorrelations 
with scales of this type are pushed in a positive 
direction, correlational results must be inter- 
preted with this response variable in mind. 
It also follows that significant negative corre- 
lations will appear only when there are ex- 
tremely strong factors operating against the 
positive influence of response set. An indication 
of this bias was apparent in the questionnaire 
battery where nearly 90% of all items corre- 
lated positively with our measure of agreement 
tendency. 

Balanced scales. In our attempt to study the 
response tendency in the questionnaire realm, 
we remained acutely aware of these difficul- 
ties. In trying to identify the properties of the 
response variable, we had occasion to make 
use of ‘‘balanced”’ scales, containing an equal 
number of items scored positively and nega- 
tively. This type of scale is a useful method of 
cancelling out at its source the contaminating 


influence of agreement response set on sum 
scores, and thus provides at least a statistical 
solution to some of the difficulties raised by 
this biasing tendency. 


Response Set as a Personality Variable 


Our primary interest in this study has 
been in the fundamental psychological deter- 
minants of agreeing response set within the 
responders themselves. Basing our approach 
on the conviction that response set is a reflec- 
tion of personality dynamics, we set out to 
explore its correlates in the realm of objective 
personality tests. An analysis of these rela- 
tionships suggested that the best single char- 
acterization of the traits associated with 
agreeing response set is Stimulus Acceptance 
vs. Stimulus Rejection. The yeasayer accepts 
stimuli both by admitting them to conscious- 
ness without censorship, alteration, or assimila- 
tion and by agreeing with, acting out, and 
otherwise yielding to the pressures of stimuli 
exerted on him. The naysayer tends to reject 
these same stimuli, which may be either in- 
ternal (impulses, needs, whims, sudden moods) 
or external (suggestions, provocations, coer- 
cions, excitement, novelty). The most obvious 
manifestation of the general acceptance of 
stimuli takes the form of “agreeing” with, 
marking Yes, or checking True to a wide 
variety of otherwise unrelated items and 
statements. 

Our interview study brought to light many 
facets of inner dynamics which permit us to 
go beyond a simple trait explanation of the 
agreeing and disagreeing response tendencies. 
In terms of personality structure, the most 
salient difference between yeasayers and nay- 
sayers is in their mode of ego functioning. Our 
results suggest that yeasayers have relatively 
“passive” (releasing) egos, as contrasted with 
the more “active” (controlling) egos of the 
naysayers. This difference appeared most 
clearly in the contrasting strength of secondary 
processes in their personality structure. For 
naysayers, the secondary processes of delaying, 
censoring, rationalizing, sublimating, and 
integrating were far more prominent than in 
the less active egos of yeasayers who portrayed 
themselves as unreflective, quick to act, 
easily influenced, and unable to tolerate delays 
in gratification. 
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These differences in ego functioning might 
best be characterized in terms of the psycho- 
logical inertia of secondary processes. High 
inertia characterizes the active egos of nay- 
sayers who have a high resistance to movement 
and change, a slowness in their responses, a 
strong tendency toward equilibrium in their 
inner world, and an ego domination that 
frequently appears in excessive control and 
inhibition. On the other hand, yeasayers are 
low in psychological inertia, reacting quickly 
and overtly to emotional pressures without 
the “dampening” effect of inhibiting and 
delaying mechanisms. 


Theory of Developmental Dynamics of Re- 
sponse Set 


To achieve a deeper understanding of this 
pattern of personality factors, it seems neces- 
sary to formulate the developmental dynamics 
of agreeing response set. The data in both 
parts of the study suggest that the contrasting 
personality patterns of yeasayers and nay- 
sayers have their developmental origins in 
the differential reactions to what are often 
called anal socialization problems. Such an 
interpretation, while not directly tested by 
this analysis, nonetheless helps to integrate 
the results of our study. 

Psychoanalytic theory (Freud, 1908, 1916) 
emphasizes that the resolution of early prob- 
lems centering around the control of elimina- 
tion underlies later patterns of impulse control. 
The child’s parents exert pressure on him to 
control his own evacuation (and other anti- 
social, value-violating impulses) with the hope 
that he will internalize their coramands and 
enable them to relax their vigilance. One type 
of reaction to such parental pressure is not 
to internalize the parental edicts, but to retain 
the need for external control and guidance, 
especially in the area of impulse control. This 
“solution” involves the child in a continual 
conflict between his desires and those of the 
restraining authorities, but at the same time 
allows him (both as child and adult) the 
pleasure of expressing impulses which are 
supposed to be suppressed. The pleasure is 
often a double one; partly a narcissistic grati- 
fication and partly the attention-getting 
satisfaction of annoying external authorities. 
This reaction could be called the anal ex- 


pressive or expulsive resolution of early 
socialization pressures, and is associated with 
ineffectual control of impulses. The second 
type of resolution involves a complete inter- 
nalization of parental control functions, 
perhaps related to an unusually strong identi- 
fication with the parents. In addition to the 
superego development stemming from this 
internalization, the ego of the child gains 
autonomy and power. In a sense, he becomes 
his own parent by transferring to his own per- 
sonality the previously external power of the 
parents to control and reward, thereby gaining 
self-control and active mastery of his in- 
stinctual demands (Erikson, 1950, pp. 76-81). 
He thus gains his freedom from external 
restraint, but at the price of severe internal 
control. When acting like the controlling 
parent, the child is rewarded from within for 
suppressing his own “childish” impulses. With 
these developments comes the desire and 
ability to delay gratification, the main com- 
ponent of active ego functioning. Since the 
parent-like ego of the child now has the ability 
to “retain” or suppress impulse expression, 
this resolution of control problems may be 
termed anal retentive or anal suppressive. 
Our results suggest that yeasaying and nay- 
saying are closely related to these two opposite 
reactions to parental pressure for control of 
antisocial behavior. Yeasayers show many 
manifestations of the anal “expressive” type 
of resolution that involves the lack of inter- 
nalization of impulse control. Their original 
uncontrolled expressiveness is only slightly 
transmuted into a conscious preoccupation 
with “anally tinged” gratifications and ex- 
pressive exhibitionism. Not having strong inner 
controls, they express their impulsive desires 
with little or no guilt. Evidences of these anal 
concerns among yeasayers are especially 
prominent in our questionnaire studies. The 
Anal Preoccupation factorial scale, which 
correlated +.75 with the OAS, contains items 
expressed in implicitly anal metaphors, such 
as: “really splurging on something,” “sitting 
surrounded by your possessions,” “‘sitting in 
the seats of the mighty,” “smelling offensive 
odors,” “saying dirty things,” “spending 
money freely,” and “exploring dirty slums and 
alleys.” The Anal Submission facterial scale, 
which correlated -+.67 with the OAS, contains 
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a set of items expressing a general need for 
external control: ‘““The college student needs 
advice and guidance from someone wiser than 
himself,” “I often wish that someone could 
make my decisions for me,’’ etc. In these items, 
we see the anal pattern in all of its mani- 
festations; on the one hand, the craving for 
excitement and the continual search for and 
indulgence in the pleasures of impulse expres- 
sion that remains unrestrained by the ego, and 
on the other hand, the need for external control 
and guidance, closely coupled with resentment 
of this same control. 

The naysayers’ retentive or suppressive 
resolution of anal problems shows many mani- 
festations of their internalization of parental 
demands for control and of their subsequent 
aversion for “anally expressive” impulses. 
On questionnaire items of the “anal” variety 
discussed above, the naysayer consistently 
disagrees. He is emphatically not interested in 
“splurging,” “messing around,” or “blowing 
up’”’—he has learned his early lessons well, 
and these impulses have ceased consciously 
to trouble him. His identification (sometimes 
overidentification) with the antianal concerns 
of his parents is also shown by his rigidly main- 
tained “suppressive” control of emotional 
responsiveness. 

The focused clinical study considerably ex- 
panded this formulation of the role of anality 
in the personality dynamics of naysayers. Their 
identification with the socializing pressures of 
their parents appears not only in their dislike 
of anal concerns but also in their avoidance of 
“childish” expressions of impulses. Having 
taken upon themselves the full burden of what 
was once external control, they no longer feel 
any need for external advice, guidance, or 
counsel. The necessity of friction and ag- 
gressiveness in competitive situations is 
strongly denied, and replaced consciously by 
reactive trust and tolerance for others. In 
interviews, naysayers are extremely reluctant 
to admit to any hostility toward their close 
friends, and overt aggressiveness is avoided 
at all costs. They avoid all other situations 
that provoke emotional expressiveness or an 
abandonment of ego controls. Much energy is 
expended in the effort to maintain a calm 
equilibrium. Lacking the youthful exuber- 
ance or carefree spontaneity of the yeasayer, 


the naysayer stresses sublimated satisfactions, 
stability, and self-control, damping down po- 
tentially disruptive emotions and avoiding 
situations which might disturb his balance. 
With little visible affect and great surface 
calm, he turns his attention toward introspec- 
tion and his complex inner life. But, in extreme 
form, this overcontrolled personality pattern 
suggests that something may be amiss beneath 
the defiantly calm surface. Avenues toward 
libidinal gratification are blocked and the 
transmutation of aggression into reactive trust 
and tolerance may, in some cases, cause ex- 
tensive parataxis and constriction of inter- 
personal relationships. 

In conclusion, we have tried to integrate the 
findings of our study ef response set by pre- 
senting a theory of the developmental dy- 
namics of “yeasaying” and “‘naysaying.”’ This 
theory is speculative, but the consistent evi- 
dence from both the questionnaire data and 
clinical results supports our contention that 
agreement response tendency is ultimately re- 
lated to a meaningful personality syndrome 
which is consistent with psychoanalytic ac- 
counts of the resolution of problems of impulse 
control. 


SUMMARY 

This study began with a realization of the 
importance of the agreeing response set in the 
field of personality measurements. It was 
evident that an agreeing-disagreeing tendency 
was permeating the answer patterns of our 
subjects to a series of questionnaire items. Our 
first step was to develop an appropriate 
measure of this response set which would be 
essentially independent of any specific content. 
By increasing the number and heterogeneity 
of items, and by “cancelling out” positive and 
negative statements of each variable, we ob- 
tained a 360-item Over-all Agreement Score 
(OAS) that met this criterion. 

This measure of response bias showed con- 
siderable internal reliability, consistency over 
time, and generality over tests. The agreeing 
tendency was highly related to “‘true-saying” 
and “yes-saying’” on other standard tests. 
We then investigated the correlates of the 
OAS to the variables of several objective test 
domains. A cluster of scales characterized 
the positive end of the agreeing tendency: 
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Impulsivity, Dependency, Anxiety, Mania, 
Anal Preoccupation, and Anal Resentment. 
The opposite end, the “disagreeing tend- 
ency,” was defined by such scales as: Ego 
strength, Stability, Responsibility, Tolerance, 
and Impulse Control. When a factor analysis 
of individual items was performed, a major 
dimension of Stimulus Acceptance vs. Stimu- 
lus Rejection emerged as central to the agree- 
ing-disagreeing response set. Personality test 
results also indicated pervasive differences 
in ego functioning, particularly as regards the 
high vs. low psychological inertia of secondary 
processes. 

In the course of our study with objective 
questionnaires, we explored the technique of 
constructing Balanced scales and noted their 
effectiveness in eliminating the influence of 
the agreement response set on scale scores. 
By using an equal number of positively and 
negatively scored items, the agreeing response 
set can be balanced out at its source. In this 
manner, we could demonstrate the lack of any 
strong relationship of authoritarianism to 
agreement response set. The independence 
of response set and authoritarianism was sup- 
ported by a factor analytic study where these 
variables emerged as orthogonal dimensions. 
In our further research with questionnaire 
items, we noted that “item tone” was a sig- 
nificant influence in attracting or repelling 
agreement. By using this knowledge, we con- 
structed a “‘Naysaying”’ Low F scale that 
showed the expected characteristics of a non- 
authoritarian measure. 

Having discovered a fairly definite psycho- 
logical pattern of objective scales that were 
empirically related to the Over-all Agreement 
Score, we proceeded to a clinical analysis of 
extreme Ss at opposite ends of this agreeing 
response tendency. A series of hypotheses were 
formulated about the personality differences 
between “yeasayers” and “naysayers.” Using 
a clinical interview structured around a sen- 
tence completion test, we confirmed the 
main aspects of our personality formulation of 
the agreement response tendency. Yeasayers 
were shown to be individuals with weak ego 
controls, who accept impulses without reserva- 
tion, and who “agree” and easily respond to 
stimuli exerted on them. The naysayer inhibits 
and suppresses his impulses, in many ways 


rejecting all emotional stimuli impinging 
on him. 

The clinical study suggested a possible de- 
velopmental explanation of the deeper deter- 
minants of the yeasayer’s and naysayer’s 
personality structure. During their early 
socialization period, they tend towards op- 
posite resolutions of the “anal” problems 
around control of impulses. The passive and 
receptive egos of yeasayers seem to be an 
outgrowth of their lack of internalization of 
parental control. While maintaining the need 
for external control, yeasayers can freely 
indulge in impulse gratification. On the other 
extreme, the active, guiding egos of the nay- 
sayers apparently result from an internaliza- 
tion of the controlling functions of the parental 
figures they had strongly identified with 
during childhood. The egos of naysayers take 
over the controlling functions of the parents, 
and the suppression of impulses is subsequently 
self-maintained and self-rewarded. 

In conclusion, we feel that this integrated 
study, though limited by the size and nature of 
our population, has demonstrated both the 
far-reaching importance of response set in the 
area of psychological tests and the major 
proposition that the agreeing response tend- 
ency is based on a central‘ personality syn- 
drome. 
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ACHIEVEMENT AND NURTURANCE IN TRIADS VARYING IN 
POWER DISTRIBUTION! 


MARILYN V. CHANEY anp W. EDGAR VINACKE 


University of Hawaii 


HE following study is part of a program 

of research concerned with intragroup 

relations (coalition-formation or non- 
formation) that develop in a competitive situa- 
tion where winning depends upon establishing 
an appropriate bargain with another member 
of the group. The point of departure was pro- 
vided by a series of hypotheses suggested by 
Caplow (1956), who based his analysis of power 
situations upon work by Georg Simmel (Wolff, 
1950; Mills; 1954, 1954) and others. Simmel 
held that any group of three tends to become 
a coalition of two against one, but that these 
alliances may shift from pair to pair, depend- 
ing upon many factors, including the immedi- 
ate issue and the personality characteristics of 
members. 

Caplow proposed that coalition formation 
should be related to the internal power differ- 
entiation of the triad. A simplified statement 
of his propositions for six power patterns is as 
follows: 

Type I (1-1-1). When all are equal, all possi- 
ble pairs are equally likely to form a coalition. 

Type II (3-2-2). When one member is 
stronger than the other two (who are equal), 
but in combination exceed the first, the two 
weaker members will usually ally against the 
stronger. 

Type III (1-2-2). When one is weaker than 
the other two (who are equal), but the weaker 
in combination with either of the others ex- 
ceeds the third, this pairing will occur more 
often than that of the two stronger members. 

Type IV (3-1-1). When one is stronger than 
the other two combined (who are equal) no 
coalition will ordinarily occur. 

Type V (4-3-2). When all are different in 


strength, but any pair exceeds the third person, 


! This article is based on the senior author’s masters’ 
thesis deposited in the University of Hawaii Gregg M. 
Sinclair Library. The analysis hereinafter presented, 
however, differs from the original, which used games as 
units rather than pairs and individuals. 
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then the two weaker will tend to ally against 
the strongest. 

Type VI (4-2-1). When all are different in 
strength, but one is stronger than the other two 
combined, no coalition will very often occur. 

Vinacke and Arkoff (1957) tested Caplow’s 
hypotheses in a simple game situation (to be 
here described). Their results sustained all the 
hypotheses. The general strategy displayed 
was interpreted as depending upon the percep- 
tion of initial strength, in contrast to an alterna- 
tive strategy of rational analysis of outcome, 
derived from the theory of games (see 
Vinacke & Arkoff, 1957). The former strategy 
has characterized both male and female, as 
well as mixed-sex, triads, playing under several 
incentive conditions. 

The question now arises concerning the effect 
of personality variables upon play when power 
pattern varies. A number of studies, employing 
three-person groups, are suggestive in this 
regard. Mills (1954) found that differences in 
social status have little effect on coalition 
formation, but that need for self-enhancement 
is more important in high- than in low-status 
members. Chapman and Campbell (1957), 
varying the authoritarianism of members, 
found no significant superiority for any combi- 
nation. DeCharms (1957), in contrast, showed 
that threat-oriented affiliation results in high 
productivity in competitive tasks, but low 
productivity in cooperative tasks. In a study 
by Hoffman, Festinger, and Lawrence (1954), 
the importance of the task and the degree of 
comparability (perception of others as peers or 
non-peers) both proved to affect strategy in a 
game situation. Thus, weaker members more 
often allied against a person with a large initial 
advantage when the task was of little impor- 
tance and the strong member was thought to 
be superior to them. 

In order to manipulate motivational condi- 
tions, we selected two of Murray’s “manifest” 
needs (1938) most obviously likely to influence 
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play in a competitive game. “Achievement” 
has been described as desire to overcome 
obstacles, exercise power, and to strive to do 
something difficult as well and as quickly as 
possible. It is said to be characterized in part 
by enjoyment of competition and determina- 
tion to win. “Nurturance,” in contrast, is 
defined as need to nourish, aid, or protect the 
helpless, to express sympathy, and to be asso- 
ciated with traits of generosity, sympathy, and 
tolerance towards others. 

The problem of this experiment is to com- 
pare the behavior of subjects (Ss) high in 
achievement with Ss high in nurturance in a 
competitive three-person game, with the third 
member serving as a control. It was hypothe- 
sized that the former would display a pattern 
of behavior reflecting desire and effort to win, 
the latter a more accepting and less competi- 
tive style of play. 


METHOD 


Subjects 


Twenty male triads were recruited from the intro- 
ductory psychology course. All students were adminis- 
tered the Edwards Personal Preference Schedule 
(Edwards, 1954), a test that purports to measure 
motivational tendencies in accordance with Murray’s 
formulations. Among the 15 variables assessed by the 
test are achievement and nurturance (the correlation 
between them, as reported by Edwards, is r = —.30). 
Ss high in achievement were selected from those falling 
at or above the 74th percentile on that scale and at or 
below the 24th percentile on nurturance. For nurtur- 
ance, Ss were chosen from those who fell at or above the 
70th percentile on that scale and at or below the 24th 
percentile on achievement. Control Ss were taken from 
those between the 50th and 58th percentiles on achieve- 
ment and the 37th to 63rd percentiles on nurturance. 
Each triad was composed, in this fashion, of an S high 
in achievement, an S high in nurturance, and a control. 
The characteristics of the triads may be summarized, as 
follows: 


Achievement Score Nurturance Score 


AL SD M SD 
Ach Ss 20.75 1.99 9.25 57 
Con Ss 15.25 1.52 13.25 1.65 
Nur Ss 9.70 2.36 21.15 


Game Situation 


A simple game was devised by pasting numbers con- 
secutively on the outside spaces of an ordinary parchisi 
board. To the colored counteres (red, green, blue, and 
yellow) were affixed on one side numbers corresponding 
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to the requisite weights that were to determine the 
strength of the players in the game. The power types 
used in this experiment were the Types I, II, V, and 
VI, defined above, and referred to below as Type I 
(1-1-1), Type II (3-2-2), Type III (4-3-2), and Type IV 
(4-2-1). Each triad played through 12 games in three 
series of each type, with position of the type in each 
series randomized. The three required counters were 
presented to Ss, with weights concealed, in a hopper, 
from which they drew in turn (order rotated among 
Ss). This technique did not guarantee an equal occur- 
rence of each weight for each S, but did enhance 
interest in the game, a condition deemed essential. An 
arbitrary “prize” of 100 points could be won, all or in 
part, in every game. 


Procedure 


After the triad was assembled (without mention of 
the motive test previously administered in the lecture 
class), EZ explained that they would play a game and 
presented each member with a copy of the following 
instructions: 

This game is a contest among three players. Before 
the game begins, each player draws a marker out of 
the hopper. This marker indicates the strength that 
the player is to have during the game. When the 
referee throws the die, each player receives one space 
for each spot shown, and the player moves forward 
the number of spaces to which his strength entitles 
him. For example, if there is a “2” on the marker 
and the die zhows “3,” the player will move forward 
six spaces. 

The game is won by the first player to encircle the 
board. This player is awarded 100 points except that 
in the case of ties, the winning players will divide 
the prize equally among themselves. 

At any time during the game, any player, in return 
for a promise of a specified portion of the prize, may 
form an alliance with any other player. In this case, 
the allies immediately proceed to the position equal 
to their combined acquired spaces, and in the future 
throws they move together according to the total of 
their combined strengths. 

Any player may concede defeat when his position 
seems hopeless. 

After the instructions had been clarified, as necessary, 
the first draw was conducted. Thereafter players were 
left to their own devices, except to assure that they 
played according to the rules. E remained an impartial 
referee throughout the play. He kept a record of weights 
drawn, progress of play, alliances offered and fo:med, 
and other relevant information. 

All triads soon discovered that actual play is not 
necessary once coalitions are agreed upon, or, in Type 
IV, once the draw is completed. 


RESULTS 


The data concerning coalitions are pre- 
sented in Table 1. Two points of explanation 
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TABLE 1 


Coatrtions AMONG THREE Motive-ConpbiITIONS IN TERMS OF OPPORTUNITY 
(N = 20 Triads) 





Type I Type II (3-2-2) 











eS Type IIT (4-3-2) 
% Opportunity (1-1-1) Strong (32) Weak (22) 

AC | AN®) CN | AC | AN | CN) AC | AN) CN) Strong (43) | Strong (42) | Weak (32) 
51 or more 4} 0| 9] 0] 1] 0| 6| 9| 14] 1] 2 2} 1] 1| 1] 8| s| 9 
33 to 50 g/10/ 7/ 4] 4| 2] o| 3| 2] 2 4d Hid Hod fod Woe foe oe 
1 to 32 s| s| 2] 1 1/0} 0} 0} 0} a) 4/4) 4 | 1) 0} 4] 0 
0 3| 2] 2} 15/13/14] 2] 2| 2/10] 8}11| 8/12/11] 2] 4} 3 
20 | 20 | 20 | 20 | 19 16| 8| 14| 18 14 | 12 | 16 i | a7) 14 | a2) 12 | 13 
M% 36 | 26| 50! 9/14] 5|75| 70| 78| 16| 22] 19|.15| 16] 11 | 68 | 48 | 67 

(Triple Coalitions) (28)  —_— ae (6) ' 








ns of entries: A, C, N refer to achievement, control, and nurturance Ss; each frequency refers to a pair (o 
the dashed line shows the division employed in computing chi square. Triple alliances are prorated among 


Note—Explanatio 
S) in a single triad; 
pairs. 

* AN vs. CN: X? = 3.96, "8 = 1, P <0.5. All other testable comparisons not significant. 


TABLE 2 
PARTICIPATION IN COALITIONS BY THREE MOTIVATION CONDITIONS IN RELATION TO OPPORTUNITY 
(N = 20 Each) 


Tl 
|! 


Type III (4-3-2) 





Type I Type II (3 -2-2) 
% Opportunity (1-1-1) Strong (as 3) Weak (as 2) 
a a . a Strong (as 4) Middle (as 3) | Weak (as 2) 
A* ¢ N A Cc N A . N 
f f f f f f f f f | 
51 or more 4; 15) 10 2 2 2; 11/13) 17 3 4 2} 10 9/10] 10 9 8 
33 to 50 11} 5| 8/ 3] 3} o| 3] S} 1] 4] 2] 1) 1] 4] 1) 4] 4] 3 
1 to 32 5 0 2 1 0 0 0 1 2 1 1 1; 0 0 0 | O 0; 0 
0 o| 0] 0/12] 9] 6] 1] o| of 8| 5] 8} 3] 3] 1] 2] 2] 3 
20 | 20 | 20 | 18} 14 8 | 15 | 19 | 20] 13 | 12 | 12 | 14] 16] 12 | 16 | 12 | 14 
M% 39 | 60 | 54 | 18 | 20 | 25 | 80 | 79 | 85 | 24 41 | 18 | 71 | 67 | 85 | 73 | 76 | ot 





Note.—See Table 1. 
* A vs. C: x? = 12.12, 8 = 1, P > .001; A vs. N: x? = 3.96," = 1, P > .0S. All other testable comparisons not significant. 


are needed. First, it will be noted that triple players (except in Type I), each pairing did 
coalitions also occur, especially in Type I. Al- not necessarily have the same opportunity to 
though these might be omitted in determining occur. For example, the 3-2 combination in 
the incidence of alliances, it facilitated analysis Type II could happen between C and N only 
to prorate them among the participants. in 16 of the 20 groups. Furthermore, within a 
Second, since the method of drawing weights triad, a given player might draw a particular 
did not guarantee equal distribution among the weight one, two, or three times. The data, 
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TABLE 3 , 

INITIATIONS AND RECEPTIONS BY THREE MOTIVE CONDITIONS IN RELATION TO OPPORTUNITY WHEN THE “WEAK”? | 

PosiTion Is DRAWN IN Types II anp III 

| 

Initiations Receptions 
oe a — ae ae 
Offer Coalition Offer Coalition 
> Opportunity A Cc N A Cc N A Cc — ae A Cc a - 4 
Y 2 
51 or more 9 7 5 9 6 3 4 6 4 3 6 7 3. | 
33 to SO 1 2 8 1 3 7 4 6 10 6 4 6 4. 
. 

1 to 32 i ~%’ ¥ 1 2 5 1 4 3 6 4 6 
0 7 9 5 8 9 5 10 4 3 9 5 4 — 
19 20 20 19 20 20 19 20 20 19 20 20 j II 
MY 46 37 42 44 35 33 20 46 47 26 42 45 eg 

x? ” P x? ” P x ” P x ” P 

: gre 
A vs. C 62 1 <.50 1.23 1 < .30 4.51 1 <.05 2.12 1 <.20 an 
A vs. N 2.12 1 <.20 4.78 1 <.05 6.22. 1 > .01 3.29 1 > .05 dis 

C vs. N 48 1 <.50 1.30 1 <.30 44° 1 >.50 16 1 <.70 | 
——— 
* Division for C vs. N | ne 
| col 
; , ~ , , ; ple 
therefore, are cast in terms of the percentage Weight 2 in those games). Each entry in the | to 
of the opportunities in which a particular pair _ table is based upon the occurrence of coalitions = 
could form a coalition. Chi square was used in as percentage of opportunity. “Initiation” | ga 
statistical analysis, although means are also refers to proposing that an alliance be formed; TI 
presented. The peculiarities of the distribu- “reception” means the member to whom the tes 
tions made / tests inappropriate. offer was made. “First offer” signifies the | an 
Two striking points are evident: (a) all pairs opening gambit, and “final coalition’? means ju 
are about equally represented, except in Type _ the alliance that terminated play inthat game. | 6, 
I, where CN is significantly more frequent Cases where only one offer was made and ac-_ | - 
than AN; (8) as in other experiments in this cepted are assigned to both classes of events. — 
series, the alliances are formed in accordance Scrutiny of Table 3 discloses that Ss high in os 
with initial strength, the weak pairings oc- achievement tend to be initiators, with the | 4, 
curring in the great preponderance of instances. other two acting as recipients. | th 
From this standpoint, motivational differences We may quickly dispose of Type IV. As in pe 
among the players have very little effect upon other experiments, coalitions are formed much ois 
play. less often than in other types (here, in 30% of ole 
Much the same result appears when partici- the games, all but one of the alliances in- de 
pation of each motivational category is deter- volving the all-powerful players). It is worth ac 
mined, as shown in Table 2. Except in Type I, noting that in line with our hypotheses, Ss ? nt 

where Ss high in achievement are involved in high in achievement permitted only 19% of 

alliances significantly less often than either of these outcomes when they held the strong in 
the others, there are no significant differences. weight, but that the controls and Ss high in hi 
Analysis of the characteristics of play, how- nurturance allowed 41 % each. he 
ever, reveals a distinction between Ss high in The question of deals actually consummated pc 
achievement and high in nurturance. Table 3 can be combined with the general matter of en 
summarizes performance for the “weak” winning. Analysis showed no significant dis- en 
players in Types II and III (i.e., drawing  tinctions, such as one kind of S receiving a Sif 


WEAK’”’ 


1 the 
tions 
tion” 
med; 
1 the 

the 
leans 
ame. 
1 ac- 
ents. 
h in 

the 


is in 
1uch 
To of 
in- 
orth 
Ss 
of 
‘ong 


11m 


ited 
r of 
dis- 


—~- 


ACHIEVEMENT AND NURTURANCE IN TRIADS 


TABLE 4 


PERFORMANCE OF THREE MOTIVATIONAL GROUPS 
FoR ToTaL SERIES OF GAMES 
(N = 20 Triads) 


Index of Accomplishment —_ Control —— 
1. Mean points assigned* | 392.50 | 382.50 | 425.00 
2. Mean points gained 393.53 | 389.86 | 416.13 
3. Mean gain or loss over 1.03 7.36 | —8.87 
assigned points 
4, Mean rank in triad 
Assigned points 2.2 2.0 1.8 
Points gained 1.9 2.1 2.0 
Net over assigned .. 1.9 2.2 


* Points assigned as follows: Type I: 33.3 points per S; Type 
II: 50 points for each “2” drawn; Type III: 50 points for each “2” 


or “3’’ drawn; Type IV: 100 points for each “4” drawn. 


*Avs.N:x? = 5.58,8 = 1, P > .01. 


greater share of the points (indeed, all alli- 
ances in Type I were on a 50-50 basis, and 
disproportionate deals were too infrequent in 
other types to permit statistical analysis); 
nevertheless, Ss high in achievement appeared 
consistently to benefit during the course of 
play. In consequence, several efforts were made 
to assess the comparative success of the three 
motive categories over the entire series of 12 
games. Table 4 gives some of these indexes. 
The first three rows depict the situation in 
terms of points earned. Because of discrep- 
ancies in actual potential (power), a gross ad- 
justment was made by ascertaining what the 
outcome would be if each S played strictly ac- 
cording to initial strength (as explained in the 
note to the table). The second row shows the 
actual total accumulated, were outcome in the 
total series of games to be summed. The 
third row gives the average gain or loss when 
points won are subtracted from points as- 
signed. Although none of the differences is 
significant, nevertheless, it can be seen that 
despite the lower potential of Ss high in 
achievement, compared to those high in 
nurturance, the former end up quite well. 
Finally, the fourth comparison concerns 
intratriad relationships. It is clear that Ss 
high in achievement gain enough points some- 
how to move, on the average, from the lowest 
potential rank to the highest actual rank at the 
end of the series. It is striking that the differ- 
ences in average rank are not statistically 
significant, but that when Ss are compared in 
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terms of rank in net gain over assigned points, 
those high in achievement are significantly 
different from their rivals who are high in 
nurturance (x? = 5.58, P < .01). In fact, Ss 
high in achievement ranked first, in these 
terms, in 10 of the 20 triads, compared to 3 
for Ss high in nurturance. The strategy 
whereby this was accomplished will occupy 
us in the next section. 
DISCUSSION 

Two major points emerge from the results of 
this experiment: first, that perceived difference 
in power is a very compelling determinant of 
behavior, outweighing motivational differences 
and, second, that even despite this factor, 
persons high in achievement manage to im- 
prove their positions relative to the other 
participants. 

The first point is fully in line with other 
experiments, since we have yet to discover a 
condition that overrides the outcome predict- 
able on the basis of relative strength. A glance 
at Table 4 reveals that despite the deeper 
considerations to be discussed, Ss high in 
nurturance actually acquired the most points, 
as reflected in their having fortuitously oc- 
cupied the most favorable position in terms of 
initial strength. Table 1 and 2, also, show how 
the weak pairings vastly exceed those involving 
a strong member (Table 1) and how participa- 
tion in coalitions by individual Ss corresponds 
to their initial power (Table 2). 

It is remarkable, nevertheless, that motive 
conditions are related to style of play within 
this general principle, for Ss high in achieve- 
ment are able to arrive at a much better final 
outcome than their initial potential would 
indicate. They tend to be initiators of offers to 
ally when they are weak, whereas their fellow 
players tend to be recipients of offers (Table 3). 
In Type I, on the other hand, Ss high in 
achievement are significantly less often in- 
volved in alliances. As a consequence, motive 
conditions seem to operate differently when 
participants are equal in strength, compared to 
situations where power differences exist. If, 
however, we were to consider that the course of 
play itself may bring about some awareness of 
the other players’ characteristics—and we 
would suppose that this would occur early- 
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then the ostensible all-equal situation would 
become modified to the degree that Ss high 
in achievement acquire an initial advantage, 
so that, as shown by Hoffman, Festinger, and 
Lawrence (1954), the other two would tend to 
ally against him. (An additional point is the 
fact that scarcely any bargaining occurred in 
Type I; it should also be recalled that differ- 
ences in Type I represent an excess over 
triple alliances, 47 % of the total.) In this sense, 
Type I may not be an exception to the strategy 
of playing by initial strength. 

Why were Ss high in achievement relatively 
successful in capitalizing upon their assets? 
This problem requires that we focus attention 
upon the complex of factors within the triad 
itself. By no index does the player high in 
achievement actually win the competition 
(except when he would be predicted to do so 
on the basis of his actual potential; the S with 
the highest potential won in 16 of the 20 
groups). Rather, the person high in achieve- 
ment succeeds in improving his position rela- 
tive to the other two. No one of the devices by 
which he does so attains significance by itself, 
but all work in the same direction. He tends to 
seek alliances more often when he is strong as 
well as weak (too infrequent, however, for 
statistical analysis) and thus gains more 
alliances of this type, refuses more often to 
form alliances when he is all-powerful, is 
willing to accept a smaller proportion of the 
prize in return for an alliance, and so on. We 
have been led to the interpretation that the 
individual high in achievement seeks to manip- 
ulate the internal relationships of the triad to 
But no 
single tactic emerges, except the tendency to be 


maximize his chances of winning. 


an initiator; rather, strategy appears to be a 
function of all the conditions present in a 
particular group. For example, in Type I, he 
may be content with a 50-50 deal if he can get 
it; failing this, he may propose a triple alli- 
ance, thus gaining at least a third of the points. 
On the other hand, if he perceives a tendency 
for the other two to ally against him, he may 
quickly accept a tentative triple alliance, 
proposed by another member, without seeking 
a better deal. Similarly, he may either seek a 
comparatively small share of the prize when he 
draws the strong position or he may be satisfied 


to see what the other two propose; if they do 
not form an alliance he gains all the points (a 
rare occurrence, but favoring the player high 
in achievement). If this interpretation is 
sound, then the primary consideration for the 
person strongly motivated to achieve has little 
to do with the actual score, but much more 
with his standing relative to his competitors 
seeking to improve his position in whatever 
way is feasible under the circumstances. In 
this fashion, he apparently succeeds in weaving 
all the possibilities into a relatively more 
successful style of play than is the case for his 
opponents. Is this not always typical of the 
successful player? What baseball player is 
really concerned about the size of the final 
score? He knows that a score of 2 to 1 counts 
just as much as a score of 19 to 0. 
SUMMARY AND CONCLUSIONS 
An experiment was conducted to determine 
the effect of motivational differences on play in 
a competitive three-person game where initial 
strength of participants varies. In each of 20 
groups, one member was high in achievement 
but low in nurturance, one high in nurturance 
but low in achievement, and one intermediate 
in both variables. Each group played 12 games, 
three each of four types of power pattern. 
Performance was analyzed in terms of coali- 
tions, participation in coalitions, initiation and 
reception of offers to ally, and final outcome. 
In general, the course of play corresponds 
much more closely to the pattern of initial 
strength than to motivational differences. 
Within this condition however, Ss high in 
achievement tend to initiate offers, whereas 
those high in nurturance tend to receive them. 
Assessment of relative success at the end of 
play shows that Ss high in achievement im- 
prove their rank within triads to a significant 
degree. It is suggested that the person high in 
achievement employs a strategy, adapted to 
immediately relevant conditions, that serves 
to capitalize upon his assets. 
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HE primary focus of this study was on 
the differential reactions of the leader, 


the second ranking member, and the 
lowest status member of an informal group to 
pressures from other members of the group. 


The reciprocal side of this question, the differ- 


influence of members in these three 


on the evaluations of the other 


ential 
Status position 
members, was also investigated together with 
the relative weight of each in determining the 
group norm. 

A member’s striving to gain and maintain 


status in one of h 


is more important reference 
groups should be among the key determinants 
of his conformity to conflicting group judg- 
ments. The higher his motivation to attain 
these ends, and hence the greater his depend- 
ency on the group for motive satisfaction, the 
greater should be his susceptibility to group 
influence. This possibility assumes the status 
system of the group to be sufficiently open 
that a member perceives as real the possibility 
of either moving up the status hierarchy or of 
being displaced from his present station by 
other aspirants to his position. Viewed this 
way, the central question of this study becomes 
that of whether or the 
status occupants are differentially motivated 


not three selected 
to improve and maintain their positions in 
their reference These three members 
were selected as the main figures of the study 


group 


because there exist reasons for 


attributing greatest striving and conformity 


plausible 


to each of them 

The hypothesis that motivation is a simple 
function of the magnitude of distance from the 
goal would favor the possibility of greatest 


striving and conformity on the part of the 


This study was carried out under support of the 
Group Psychology Branch, Office of Naval Research, 
Contract Nonr 2149(02) with Vanderbilt University 
Thanks are extended to ONR for its support; to John 
Whaley for his aid in collecting the data; and to the 
officials and boys at Tennessee State Vocational Train 
ing School, Jordania, for their interest and participa 


study 


tion in the 
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lowest status man, in line with results of 
Fenchel, Monderer, and Hartley (1951) in their 
test of Benoit-Smullyan’s (1944) status equili- 
bration hypothesis. In their study, the refer- 
ence groups in which subjects displayed the 
greatest status striving were those in which 
they perceived their status as lowest. And yet, 
the very remoteness of the lowest status man 
from the top position might result in his being 
less motivated to move up the status ladder 
and consequently less sensitive to group pres- 
sures, especially if his position were a secure 
one. If, on the other hand, his position 
tenuous one, and if, as Thrasher (1927, p. 
long ago averred, “*. . . any status in the group 
is better than none....,” then the lowest 
status man might manifest greatest conformity 
since any further status loss would jeopardize 


is a 
332) 


his membership in the group. 

The second ranking person could prove to 
be the member on whom the goal of the top 
position exercises greatest motivational pull. 
Studies relating to gradient, level of 
aspiration, and class mobility seem consistent 
with this possibility, as is William James’ 


goal 


observation on . the paradox of a man 
shamed to death because he is only the second 
pugilist or the second oarsman in the world. 
That he is able to beat the whole population 
of the globe minus one is nothing; he has 
‘pitted’ himself to beat that one; and as long 
as he doesn’t do that nothing else counts.” 
(Knight, 1950, pp. 103-104). 

The findings of Dittes and Kelley (1956) 
suggest the greatest conformity for the second 
status man, if the degree of one’s perceived 
acceptance in a group assumed t 
relate highly to his status in the group. These 
authors found the highest incidence of con- 
formity to group evaluations to occur among 
those individuals who had been led to believe 
that their degree of acceptance by the other 


can be 


members was average, which in this case was 
next to the highest of the four degrees of 
induced acceptance. 
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The leader’s behavior should be less affected 
by striving for the top position than that of 
either of the other two status members by 
virtue of his having attained that goal. To the 
extent that conformity relates to striving for 
the goal, his need for conforming behavior 
should be at a minimum. The attainment of 
the top position, however, exposes the leader 
to pressures from a new source, that of main- 
taining his leadership status. If his position is 
secure, the leader can perhaps afford to deviate 
further from the behavior of the other mem- 
bers (Kelley, 1951); but if insecure in his posi- 
tion, the leader might be the most conforming 
to pressures from the group because of the 
greater psychological distance he could fall if 
displaced from the top of the hierarchy. 

The studies of Sundby (1959) and Harvey 
and Rutherford (1958) also bear on this prob- 
lem. Higher status members were found to be 
more yielding than those of lower status to 
majority opinion when the issue involved was 
relevant to the group (Sundby, 1959). But 
higher status persons were less influenced by 
deviant evaluations of other high status mem- 
bers on an issue of minor significance to the 
group (Harvey & Rutherford, 1958). How- 
ever, in neither of these studies were leaders’ 
responses separated in the analyses from those 
of other higher ranking members. 

The differential influence of the leader, the 
second ranking member and the lowest status 
member in both the formation and the change 
of group norms was also investigated. The 
results of studies concerned with the im- 
portance of status to source effectiveness, in 
the informal group (Harvey & Rutherford, 
1958) and in larger social units (Hovland, 
Janis, & Kelley, 1953), led to the simple 
assumption that the relative influence exerted 
in both these instances would follow the status 
positions directly. The design of the study was 
not ideally suited, however, for answering the 
latter questions. 


METHOD 


Subjects 


Twenty-seven informal cliques (122 Ss in all) were 
selected sociometrically from a state training school for 
delinquent boys. Nine groups, each with an average of 
4-5 members, served in each of three conditions of 
source influence. Each group served only in one source 
condition. 


Cliques in the training school were selected for study 
because of the assumption that status within them was 
very important to the members and that the resulting 
pressures against the deviant would be pronounced. 
The cliques were ascertained by a four-item sociometric 
questionnaire which asked for preferences from among 
all the boys in the school as teammates and as team 
captains for a baseball team and for the team in which 
they preferred to perform the experimental task. Only 
the responses on the two items having to do with pre- 
ferred captains or leaders were considered in determin- 
ing status within the group. 


Task and Apparatus 


The experimental task was the judgment of the dis- 
tance between two simultaneous flashes of light in a 
completely dark room. Two sets of such stimuli were 
used for each group. The leader or the second or the 
lowest ranking member (source) was exposed at appro- 
priate stages to one set of stimuli while the remaining 
members of the group (members) were being presented 
with the other pair of light flashes. In all conditions, 
the distance between the stimulus lights was 12 in. 
for the members, while for the source the distance be- 
tween the lights was either 12 or 48 in., depending on 
the condition. Unknown to the Ss, 2 partition sepa- 
rated the two sets of stimulus lights so that the source 
and members could be exposed to different stimuli 
while assuming they were seeing the identical stimulus. 
The use of the partition avoided the need for col- 
laborators, thus leaving Ss free to interact naturally 
and to impose the pressures normally applied in their 
particular group. Special caution was exercised by E 
throughout the experiment to prevent Ss’ bumping 
into the partition and becoming suspicious of the pro- 
cedure of the study. 

Activation of the lights was controlled by circuits 
in two Hunter Decade Interval Timers. The two lights, 
on the same circuit, flashed on and off at the same time. 
Exposure time was .4 sec. and the time between trials 
was either 15 or 30 secs., 15 secs. for private and 30 
secs. for public judgments. 


Procedure 


The sociometric questionnaire was administered in 
the boys’ own dormitory approximately two weeks 
before the selected groups participated in the experi- 
ment. The questionnaire was introduced with the 
explanation that its purpose was to pick groups of 
friends who would like to participate in a forthcoming 
experiment. The experiment was described as a study 
being sponsored by the Navy and concerned with the 
visual accuracy of individuals working in small groups 
in complete darkness. Cash prizes were promised for the 
three groups that made the most accurate judgments 
on the experimental task. The cash awards promised 
for the first, second, and third most accurate groups 
were six, four, and two dollars, respectively, for each 
person in the group. It was emphasized that the scores 
would be based upon judgments of the entire group so 
that no single individual could win a prize without his 
whole group having won. The offer of cash awards and 
emphasis upon the group as the winner was intended to 
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increase the likelihood that the boys would take the 
study seriously, make careful selections in the socio- 
metric choices, and hence be more prone to apply group 
pressures against persons who made deviant judgments 
in the subsequent experimental situation. 

The conditions according to which the cash prizes 
would be awarded were re-emphasized to each group 
as it came to take part in the experiment. Prior to 
going into the experimental room, each person within 
a group was assigned a number that represented the 
order in which judgments of the light distance were to 
be called out. The source, one of the three selected 
status occupants, was always assigned the highest 
number so his judgment would be made last. Ss were 
then led into the experimental room, a large room in one 
of the dormitories, and seated in the reverse order 
The source, always seated first, was placed on one side 
of the partition and the members were seated on the 
other side of the partition so that neither could see the 
stimulus lights of the other 

Ss were then instructed that their task was simply 
to judge the distance between the two flashes of light. 
The source and members, after being advised correctly 
of the 12-ft. distance between them and the lights, were 
then presented the 12-in. stimulus lights and told the 
actual distance between them in order to reduce further 
the intersubject variability in scales of judgment. To 
prevent the sample presentation from stereotyping 
the Ss’ responses, E pointed out that the pairs of lights 
that were to follow might all be the same distance 
apart, or no two pairs might be the same. 

The procedure from here on was divided into three 
phases 

1. This phase was concerned with the differential 
contribution of the highest, middle, and lowest status 
members to the formation of a group norm. The aim 
was to have each person first reach an individual 
standardization in his judgments of the stimulus and 
then have a group norm formed in which these personal 
norms would be differentially reflected. To form the 
personal norm all Ss were presented the same stimulus, 
the light flashes of 12-in. distance, for 20 trials with a 
time interval of 15 secs. between trials. In this stage 
Ss recorded their own judgments on small pads without 
knowledge of any other member’s judgment. The group 
norm was formed by presenting the same stimulus for 
10 more trials with only one judgment acceptable to all 
members allowed for each trial and with an interval of 
30 secs. between trials for discussion and joint decision. 
A score of zero, represented to the Ss as harmful to 
their chances of winning a prize, was recorded if a 
group decision was not reached within the 30-sec. 
period. Group pressures consequently were applied to 
the point that a zero score rarely occurred, and when it 
did it was always only for the first trial. 

2. This was the phase critical to the main problem. 
It was concerned with the extent to which the occu- 
pants of the selected status positions would deviate 
from the judgments of the other members; and if there 
was deviation, with the differential influence of the 
three sources on shift in judgments by the other mem- 
bers. This objective was approached in two steps. To 
provide a contro] measure for both source and members 
for the subsequent influence situation, all Ss once 


again made 20 private judgments of the presented 
stimuli, which they recorded without discussion. In 
this block of trials, however, the source and members 
were presented different stimuli. The source was ex- 
posed to the pair of lights 48 in. apart from one side of 
the partition, while from the other side the members 
continued to be presented with the 12-in. stimulus 
lights. Following the 20 private judgments, the same 
stimuli were judged for 20 more trials under influence 
conditions in which, instead of being forced to reach a 
common group norm, each S gave his own judgment 
aloud after each trial. An interval of 30 secs. between 
trials allowed enough time for Ss to give their judg- 
ments of what they assumed erroneously to be a com- 
mon stimulus and to register their reaction of approval 
or disapproval of the evaluations of the others. 

3. This last phase was concerned with the continua- 
tion of the effects of social influence in a situation free 
of external pressures, or in current terminology, the 
correspondence between public and private conformity. 
For this purpose, the source and other members ren- 
dered 10 private judgments of the same stimuli that 
they had judged publicly in the preceding phase. 

One E remained in the experimental room with the 
Ss during the experiment, while a second E, seated in 
an adjoining room, controlled the presentation of the 
stimuli. 


RESULTS 


Differential Influence of Status in Formation 
of Group Norm 


This result was determined by subtracting 
the personal norm of the leader and of the 
second and lowest status members in each 
group, represented by the mean of his first 
20 private judgments of the i2-in. stimulus, 
from the group norm, represented by the 
mean of the 20 unanimous judgments of the 
same stimulus by the entire group. The as- 
sumption was that the smaller the difference 
the greater the weight of the respective source 
judgments in forming the group norm. Results 
in Table 1 show the difference to have been 


TABLE 1 
DIFFERENCE BETWEEN THE PERSONAL NORM OF 
LEADER, SECOND RANKING, AND Lowest STATUS 
MEMBER AND SUBSEQUENT GROUP 
Norm FOR THE 12-INn. StrmvLus 


Group Comparison of Differences 
Norm- 
a 
Status Source’s Per Sa ” 
sonal Norm Diff t P 


Leader 3.67 

Second 4.71 

Lowest 5.24 
* N = 27 for each of the three sources, a source of each status 
from each of the total 27 groups which were combined in this com- 
parison 





Leader-Second 1.04 1.28 >.20 
Leader-Lowest 1.57 1.39 >.10 
Second-Lowest 45 .30 >. 
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TABLE 2 
Sources’ MEAN JUDGMENTs OF THE 48-IN. STIMULUS 
BEFORE AND DURING MEMBERS’ INFLUENCE 








| Before | During Comparison of Differences 
Geosed™ i Tetie> | Bate | TOR, | cece 


ence ence Diff. P P 
Leader 47.70 32.08 15.62) Leader- |—23.53) 3.21\)<.01 
Second 
Second | 56.67 | 17.52 | 39.15] Leader- | —4.02| .58 >.50 
| Lowest 
Lowest | 46.87 27.20 19.68) Lowest- |—19.47\22.94/<.001 
| Second | 





® N = 9 for each of the three sources, one source from each of 
the nine groups in each of the three experimental conditions. 


TABLE 3 
Sources’ MEAN JUDGMENTs OF THE 48-INCH STIMULUS 
DURING AND AFTER MEMBERS’ INFLUENCE 


During| After 
Source* | Influ- | Influ- | Diff. _—___ a 


ence ence Diff t P 

Leader 32.08 40.53 8.45 | Leader- 1.11|.14 | >.50 
Second 

Second 17.52 24.86 7.34 | Leader- —1.23).16 | >.50 
Lowest 

Lowest 27.20 36. 88 9.68 | Lowest- 2.34|.37 | >.50 
Second 

® N = 9 for each of the three sources, one source from each of 


the nine groups in each of the three experimental conditions. 


smaller, but not significantly so, the higher 
the status of the source.? Thus, while the data 
suggest that contribution to norm formation 
may be a positive function of status, the lack 
of statistical significance does not permit the 
difference in subsequent conformity to group 
pressures to be attributed to the differential 
weight of different status members in the 
initial formation of the norm, as some have 
supposed. 


Differential Conformity of the Three Sources 


Conformity of sources’ to members’ judg- 
ments was measured by the difference between 
the sources’ mean judgments of the 48-in. 
stimulus before and during the phase that 
public judgments of the 12-in. lights were 
being made by the members. Results in 
Table 2 show the second ranking source to 
have been influenced significantly more by 
members’ judgments than was either the 

? All ¢ tests reported in this study are based on the 


formula for correlated samples (Wert, Neidt, & Ah- 
mann, 1954, p. 141). 


leader or lowest status member. The leader 
was least influenced, but not significantly less 
than the lowest status source. 


Conformity of Sources in the Private Situation 
Following Members’ Influence 


It can be inferred from Table 3 that the 
influence exerted by the members on sources’ 
evaluations continued even when the external 
pressures were removed. The three sources 
tended to shift their judgments of the 48-in. 
stimulus toward the veridical when judging 
it in private, but none did so significantly nor 
to an extent significantly different from the 
others. 


Differential Effects of Sources on Members’ 
Judgments 


The fact that the three sources differed in 
the extent to which their judgments deviated 
from the members’ necessitated an analysis 
of member change in terms of proportions 
rather than absolute magnitudes: the ratio 
of member change to the amount of change 
advocated by the source. The change in each 
member’s judgment of the 12-in. stimulus 
from the second private to the immediately 
subsequent influence condition was divided 
by the difference between the mean of that 
member’s judgment of the 12-in. stimulus in 
the second private condition and the mean of 
the source’s judgment of the 48-in. stimulus 
during influence. 

In Table 4 is presented a comparison of the 
relative effectiveness of the somewhat dis- 
crepant evaluations of the three sources in 
changing members’ judgments. The higher 
the status of the source the greater his propor- 
tionate influence on members’ judgments, 
although the only significant difference in 
source effectiveness was between the leader 
and lowest ranking member (¢ = 2.01; P < 
05). 


Latent Source Effects Following Influence 


In Table 5 are presented data concerning 
changes in members’ judgments of the 12-in. 
stimulus in the third private condition, the 
condition that followed the members’ experi- 
ence of the more discrepant evaluations by 
the source. The leader’s influence on members’ 
judgments increased when the members were 
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TABLE 4 
Errects OF DiIscREPANT SouRCE EVALUATIONS ON 
CHANGING MEMBERS’ JUDGMENTS 
OF THE 12-IN. STIMULUS 


Member 
Source Mem- Mean Source Mean Mem Change 
Condition bers N Discrepancy ber Change Source 
Deviation 
Leader 31 10.20 1.53 15 
Second 34 4.65 35 08 
Lowest 30 6.63 —3.50 — .53 
TABLE 5 


LATENT Errects OF SouRCES ON MEMBERS’ PRIVATE 
Jupcments FoLtowinc Pusiic INFLUENCE 


Status of Mem Mean Mean 
bers |- “ t P 
Source Change Diff 
N 
Leader 1 1.27 Leader-Second 1.51 2.01 05 
Second 44 24 Leader-Lowest 1.72 | 2.12 0s 
Lowest #0 45 Lowest-Second —.21 38 ) 


removed from the restraints of the public 
condition, whereas the influence of the other 
two sources was less in the private than in the 
preceding public condition. The second-order 
difference between of differential 
leader influence on members’ judgments under 
conditions of private vs. public judgment, on 
the one hand, and corresponding measures of 
differential influence by the second and lowest 
ranking sources, on the other, was statisti- 
cally significant, but the difference between 
the latent effects of the latter sources was not 
significant. 


measures 


DISCUSSION 


The most important of the foregoing results 
is probably the demonstration of greatest 
conformity to group pressures on the part of 
the second ranking persons, those only one 
step removed from the leadership position. 
This finding is compatible with goal gradient 
theory and the concept of relative depriva- 
tion, both of which underlie certain phenomena 
of level of aspiration, including greater status 
strivings and conformity of members of the 
middle class. 

The failure of the study to show any clear- 
cut differences in the influence of the highest, 
second, and lowest status members in the 
formation and change of group judgments 
could well be due to the small size of the sample 


and to limitations of the experimental design 
inherent in the focus of the study. An ade- 
quate test of the effects of different sources 
would require their judgments to be made at 
a controlled and constant distance from the 
judgments of the other members, a require- 
ment not met in this study because of the 
differential effects of the group on the sources. 

The tendency for the three sources to con- 
tinue to give decreased judgments after re- 
moval of external group pressures suggests 
that the changes in sources’ judgment were 
due to something other than just public yield- 
ing to group pressure, a point similarly illus- 
trated in Sherif’s (1935) early experiment. 
Such change might be unwitting, or the inter- 
esting possibility exists that, even though a 
person may have knowingly modified his 
judgments to agree with another’s, pride may 
not allow him to switch back immediately to 
his initial evaluation even when he is free to 
do so. 

An unpredicted but not surprising finding 
was the tendency for the effects of leaders on 
members’ judgments to increase when the 
latter were rendered subsequently in private 
conditions in which the restraints of public 
surveillance were lessened. 


SUMMARY 


This study was concerned primarily with 
the differential conformity of the leader, the 
second ranking member and the lowest ranking 
member to pressures in the informal group. 
The informal groups were 27 cliques socio- 
metrically selected from a training school for 
delinquent boys. 

Under the incentive of a financial reward for 
the group most accurate in the experimental 
task, members of each group judged the dis- 
tance between two simultaneous flashes of 
light in a dark room. While the other members 
were judging the distance between two flashes 
of light 12 inches apart, the leader (or second 
ranking or lowest status members depending 
on the experimental condition) estimated the 
distance between two flashes of light 48 inches 
apart, which unknown to the Ss were separated 
from the other lights by a partition. In this 
condition, under the assumption all were 
seeing the same lights, Ss made their judgments 
aloud and brought verbal pressures against 
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the person who rendered the deviant judg- 
ments. 
rhe main finding of the study was that the 
second status member, the one only one step 
from the top, was significantly more conform- 
ing to judgments of the other members than 
was either the leader or lowest status man. 
The leader was least conforming, but not 
reliably less than the lowest status person. 
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THE STABILITY OF INTERPERSONAL JUDGMENTS IN 
INDEPENDENT SITUATIONS! 


EDGAR F. BORGATTA 


Cornell University 


HILE there are many problems in- 

volved in the use of ratings and 

rankings, their pervasiveness in 
social and psychological research attests to 
their importance. Thus, rather than to shy 
away from dealing with ratings and rankings, 
the need is for more systematic exploration 
and examination of their limitations. The study 
reported here brings together into a single 
experimental design several aspects that have 
generally received attention separately. The 
generic form of such a multitrait-multimethod 
design is discussed by Campbell and Fiske 
(1959). 

The research variables are stratified in three 
ways: 

1. Multitrait. The research includes three 
dimensions of trait description. The scores 
representing these dimensions are based on 
prior work indicating that only a limited num- 
ber of dimensions will account for most of the 
variance of interpersonal judgments. 

2. Multimethod. It provides data on inter- 
personal judgments made in two independent 
situations. 

3. Multimethod. It examines the interrela- 
tionship of self-ratings, self-rankings, and peer 
rankings. 


The Dimensions of Interpersonal Judgments: 
Prior Work 


An important emphasis ia the use of socio- 
metric ratings has been on the content of the 
criterion on which group members are asked 
to choose (or rate or rank) each other. Experi- 
ence in use and direct research have led to a 


' This study stems from the program of research 
titled, “The Variables and Conditions of Small Group 
Interaction” at New York University. Jonathan 
Robbin, technical associate, was in charge of machine 
analysis, and the research assistants were John Stim- 
son, Robert Guerrin, Gerald Marwell, and Ardyth 
Stimson. This research was supported in whole or in 
part by the United States Air Force under Contract 
AF49(638)-195 monitored by the Air Force Office 
of Scientific Research of the Air Research and Develop- 
ment Command 
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distinction between work (sociogroup) and the 
friendship (psyche-group) choices as relatively 
independent; that is, recognition has been 
given to the fact that the person who is chosen 
as the leader for a task situation might not be 
the same as the one with whom social activities 
might best be realized. Jennings (1947, 1950), 
for example, gave explicit attention to this 
analysis. Bales and Slater (1955) and Slater 
(1955) emphasized a similar distinction in the 
study of interaction process, under the rubrics 
of task and social-emotional specialists. In 
responses gathered after discussion meetings, 
however, organizing concepts included group 
member rankings on (a) contributing best 
ideas and (b) guiding the discussion, and rat- 
ings on (c) personal liking for the group 
members. 

While there have been many other classifica- 
tions of roles and of types of behavior, the 
above are particularly worthy of note since 
they stem from large bodies of empirical 
research. However, while the sets of concepts 
arising from both sociometry and interaction 
process analysis reflect an accumulated body 
of experience, the delineation of dimensions of 
ratings and rankings appears as a by-product 
rather than as a consequence of a direct inquiry 
such as Carter’s (1954). On the basis of factor 
analyses of his data, as well as a number of 
independently reported studies, Carter sug- 
gested that three categories account for most 
of the variance of individuals as seen in small 
group interaction: (a) individual prominence 
and achievement, (d) aiding attainment by the 
group, and (c) sociability. The relationship of 
these categories to the theoretical develop- 
ments in sociometry and interaction process 
analysis is directly visible. Additional work 
undertaken to examine Carter’s proposed 
three-factor system substantiates it at least 
in part, but the suggestion arises that four and 
possibly more factors may be required in 
order to account for the major portion of the 
variance of interpersonal judgments, with the 
dimensions organized around the concepts of 


the 


een 


Ca- 


ce 
cal 
ts 
on 
dy 

of 
ict 
ry 
or 


of 


ill 


STABILITY OF INTERPERSONAL JUDGMENTS 189 


individual assertiveness, sociability, intelli- 
gence, and emotionality (Borgatta, Cottrell, 
& Mann, 1958). 

We may at least conclude that propositions 
about the stability of judgments must be made 
with attention to the dimensions of content 
involved. The importance of this requirement 
has been particularly emphasized by Cron- 
bach (1955) in his analysis of discrepancy 
measures. 


The Consistency of Peer Judgments: Prior 
Work 


In the utilization of peer judgments, the 
question of the reliability of scores is a promi- 
nent and complex one. For example, if a person 
is said to have a high score on judgments 
made by peers on a sociability criterion, the 
question of reliability is not merely whether 
scores taken again in the same way for the 
same group will be relatively stable. Implicitly, 
the expectation is that the score identifies an 
attribute of the individual with some degree 
of generality. Still, it is quite another question 
to ask about the stability of such scores when 
they are taken in different groups or situations 

Referring particularly to sociometric meas- 
ures, Mouton, Blake, and Fruchter (1955) 
report only two studies of the consistency of 
judgments under conditions of experimental 
variation. The first is the study by Bell and 
French (1950) that employed a single criterion 
(leadership), in which it was found that with a 
sample of 25 subjects (Ss), organized as five- 
person groups in six successive 35-min. dis- 
cussion sessions in which no two Ss participated 
together more than once, the ranking scores 
of individuals were for any one session corre- 
lated on the average .75 with the average of 
the other sessions. The second study reported 
by Mouton, Blake, and Fruchter was their 
own, with a sample of 33 Ss organized in three- 
person groups in two successive 15-min. dis- 
cussion sessions, but in this case using 12 items 
or criteria. They concluded from their study 
that judgments of behavior can be made 
reliably under conditions of short-term inter- 
action and indicated that the items most 
reliably judged in their study were leadership, 
contribution to group decision, and dominance 
position within the group. 

Both these studies emphasize the consist- 


ency with which peers characterize the individ- 
ual, particularly in terms of short-term con- 
tact, but they do not focus on the dimensions 
of content in judgments as such. In the second 
study just mentioned, the three items reported 
to be most reliably judged might be expected 
to correlate at a very high level and thus to 
represent only one dimension. Other studies 
that have touched on the consistency of judg- 
ments with varying composition of groups, 
such as that of Bass and Wuster (1953), have 
reported positive findings but also have not 
been concerned with examining the problem of 
dimensions of judgment. 


Self and Peer Judgments 

In examining consistency over independent 
situations, there is as yet no systematic report 
on the relationship of self judgments with 
those of peers. The data reported here permit 
the self and peer judgments to be examined in 
twe independent discussion groups, with 
parallel analyses for three dimensions of 
judgment. Although the accuracy with which 
a person perceives the judgments of others of 
course enters these data indirectly, the data 
pertain to the correlation of the self-judgment 
with peer consensus. This is a measure of 
correspondence rather than of accuracy as 
such. 

In addition, prior self-ratings representing 
an abstract frame of reference are compared 
with the judgments made in terms of the con- 
crete sets of other persons in the two inde- 
pendent discussion situations. Thus, this de- 
sign permits examination of the consistency of 
self-ratings made “alone” with self-rankings 
made in the context of specific others, as well 
as the comparison of self-ratings with peer 
rankings. 


DESIGN OF THE ANALYSIS 


The sample employed for this analysis consisted of 
99 male Ss, college sophomores and juniors. Data were 
collected in three stages with from one to three months 
between each stage. 

The first type of data collected was a self-rating 
check list of descriptive behavioral characteristics re- 
sponded to as part of a general orientation test period 
for Ss. The check list represents 40 trait descriptions 
that have been developed and used in extensive re- 
search on the rating and ranking of behavior in inter- 
action situations (Borgatta, Cottrell, & Mann, 1958). 
The check list form asked the S to circle a number 








190 Epcar F. BorGATTa 


from 0 to 9 associated with a continuum running from 
“Definitely does not describe me’’ to “Definitely does 
describe me.” 

The second source of data was from a postmeeting 
questionnaire, based on the same 40 descriptive be- 
havioral characteristics, that was completed after par- 
ticipation in a five-person discussion. Ss who were not 
acquainted participated in the interaction situation for 
80 min., with four discussion topics. These topics were 
selected from an extensive attitude questionnaire called 
the General Orientations Profile that was administered 
previously at the orientation test period. Topics were 
chosen to represent orthogonal attitudinal content in 
the subject population. Thus, the successive alirnments 
of agreements and disagreements on the topics oc- 
curred by chance within a session, and the opportunity 
was maximized for a person to find a topic in which he 
had some interest and on which he took some position. 
In this sense, also, the discussion situation was de- 
veloped to maximize the possibilities for exposure to 
discussion participation on a variety of issues in a 
relatively short period of time. The questionnaire for 
ranking the group members was administered after the 
discussion. This provided self-rankings of the Ss on 
the descriptive behavioral characteristics and also the 
peer rankings (minus self-rankings) on the same 
characteristics. 

The third source of data was similar to the second, 
but was carried out in a somewhat shorter discussion 
session (three new topics, 20 min. each, with the same 
orthogonal content as three of the previous topics), 
in three-person rather than five-person groups, and 
again in groups composed of persons who were not 
acquainted. Self and peer rankings on the characteris- 
tics were available from this source 

The three scores based on the ratings or rankings 
employed in this analysis are Assertiveness, Sociability, 
and Emotionality. Each score is composed of several 
highly intercorrelated variables found to be consistently 
associated with orthogonal clusters in factor analysis. 
The score in each case is the sum of the self-ratings, 
self-rankings, or peer rankings on the relevant items. 
The scores are composed of responses to the following 
items: Assertiveness (A)—Is very active, Initiative, 
Makes many suggestions, and Assertiveness; Socia- 
bility (S)—-Shows solidarity and friendliness, Makes 
others feel he understands them, and Likeability; 
Emotionality (E)—Is very tense, Gets upset very 
easily, Tends to be very nervous, Emotionality. The 
remaining 29 items tend to be distributed on these 
factors or are involved in less well-defined factors 


RESULTS 
Dependencies between the Contents 


The matrix of product-moment intercorrela- 
tions for all scores in this analysis is presented 
in Table 1. Since the research focus requires 
comparisons of individuals in independent 
situations, the correlations were computed 
over all 99 Ss. It would be possible, also, to 
make more limited examinations of the rela- 


tionships among measures by computing, say, 
for each five-person group the intercorrelation 
matrix of the self-ratings, the five-person self- 
rankings and peer rankings, and the parallel 
scores each member received in the independ- 
ent three-person groups. If this procedure 
were carried out it would be possible to de- 
scribe directly the distribution characteristics 
of the observed relationships over the five- 
person groups. However, the procedure used 
appears appropriate in this context since the 
use of rankings holds constant the sum (or 
mean) of all ranks within the group, and the 
variability of group means of self-rankings and 
peer rankings is restricted within this depend- 
ency. Such variability would tend to attenuate 
relationships between the classes for the 
pooled data. Consistency of frame of reference 
between groups as well as within groups is 
necessary for relationships to be manifest in 
the pooled data. 

A slight but significant relationship occurs 
among the three self-rating scores with As- 
sertiveness and Sociability positively corre- 
lated (r,,s, = .23) and both of these negatively 
correlated with Emotionality (74,2, = — .21; 
‘sex, = —.22). In the self-rankings, both for 
the five and the three-person groups, Assertive- 
ness and Sociability are positively correlated 
(ras, = 37; fas, = -10), although for the 
three-person group the correlation (.10) does 
not achieve the significance level. In the self- 
rankings, neither for the five- nor the three- 
person groups was the relationship between 
Assertiveness and Emotionality negative 
(rae, = -17; rage, = -05). On the other hand, 
the correlation of Sociability with Emotional- 
ity is negative and significant for both the 
five-person (rs,x, = —.20) and the three-person 
groups (fan, = —.34). 

For the peer ranking scores in the five- and 
three-person groups, Assertiveness and Soci- 
ability are again positively correlated (ra, = 


51; ras, = .32), and Sociability is negatively 
correlated with Emotionality (ran, = —.39; 
'yx, = —-41). The relationship between 


Assertiveness and Emotionality is indicated 
to be positive, although slight and significant 
only for the three-person groups (ra,,r, 
09; rae, = .22). 

Although the measures for the three dimen- 
sions of content in the ratings and rankings 
were drawn from previous studies that indi- 
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TABLE 1 
INTERCORRELATION MATRIX 
N = 99 
Self Ratings Five-Person Groups Three-Person Groups 
Self-Ranking Peer Ranking Self-Ranking Peer Ranking 
Al $1 F 
A S2 E2 As S3 Es Aa S4 E« As Se Es 

Ai — 23 -2] 43 31 06 | 31 25 -12 22 14 05 23 31 | —03 
Si 22 11 53 | —21 | 05 07 | —19 02 42 27 | —01 0s | —02 
E} —19 | —2] 29 | 10 | —08 22 | —02 | —25 26 | —06 ; —07 17 
Ae 37 17 55 33 13 42 03 07 39 10 11 
Se —20 | 05 20 27 11 51 -16 15 23 | —2/1 
Es 21 | —08 43 14 -—17 50 21 05 22 
A; 5] 09 18 —(07 O8 ; 14 05 
53 39 02 04 05 é. 44) —2. 
Es 05 | —17 15 05 | —32 

Me 10 05 61 22 28 
a 34 00 27 | —14 
Ey 16 | —08 15 
A 32 22 


Note.—Decimal points omitted. a, 


s, and £ are the scores on dimensions of Assertiveness, Sociability, and Emotionality. Value 


of | r 20 would satisfy a symmetric test of significance at the .05 level with 95 d/- Significant values are italicized 


cated a high degree of independence among 
them, in this analysis the three contents tend 
to be related. The relationship probably 
depends in part on the fact that the groups 
had participated together only for a relatively 
short period of time and the opportunity to 
communicate had been somewhat restricted. 
In any set of peer judgments, as the group 
members get to know one another they should 
be better able to identify the characteristics 
of each person and to distinguish accurately 
between the characteristics in their rankings. 
When one knows an individual well enough, 
that is, one can fake into account his other 
qualities in ranking him on a specific quality. 
Consider the following question, for example: 
“How high is an individual on Sociability?” 
Taking into account individual differences in 
activity levels appears to be implicit in the 
question. 


Interrelationships of Self-Judgments Across 
Situations 
The existence of some dependency among 
the three dimensions of content does not pro- 
hibit examining relationships across situations 
and types of measures. Thus, to begin with, 


the self-ratings made in terms of abstract 
reference points are predictive of the ranks 
Ss assign to themselves in the five- and three- 
person groups. The self-rating score on Asser- 
tiveness is correlated with the parallel self- 
ranking score in both discussion groups 


(rang = -43; faa, = -22); the self-rating 
score on Sociability is correlated with the 
parallel self-ranking scores (fgs, = .53; 
rs,s, = -42); and the self-rating score on Emo- 
tionality is correlated with the parallel self- 
ranking scores (rg,n, = .29; ree, = .26). All 


these values are statistically significant and 
constitute the largest values, with only minor 
exception, in the vectors in the intersection 
matrices. The exception occurs for the correla- 
tion between the self-rating score on Soci- 
ability and the self-ranking score in the three- 
person group on Emotionality (sn, = —.27). 
The negative relationships between Sociability 
and Emotionality from self-rating scores to 
self-ranking scores are statistically significant 
'sen, = —.21; tan, = —-27; ree, —-21; 
ps, = —.25). The additional significant rela- 
tionship in these comparisons is a positive 
correlation of the self-rating score on Asser- 
tiveness with the self-ranking score on Soci- 
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ability for the five-person group (74,5, = .31). 

Stability of self-ranking scores may also be 
examined between the five- and the three- 
person groups. Ss participated in these groups 
with different persons, and the restriction of 
comparison obviously differs when an S 
utilizes four other persons versus two other 
persons as the frame of reference in identifying 
characteristics. It is seen, however, that the 
self-ranking scores in the five-person group 
correlate with the self-ranking scores in the 
three-person groups quite substantially, in 
spite of the obvious restriction on variance 
involved in the limited number of comparisons 
in the groups (r4,, 
50). 

The stability of self-ranking scores from 
situation to situation recalls the problem of 
contamination of measures that makes un- 
raveling data of an interpersonal type quite 
challenging. Do the relatively stable indicators 
mean that a person has a well-defined and 
stable perception of his role among others or 
does it indicate, rather, that his place among 
others is indeed relatively stable and that he is 
able to identify what it is with some degree of 
accuracy? This question can be answered in 
part by examining the self-judgments in 
terms of the peer judgments, to which we will 
turn after considering the interrelationship of 


). » 2. . 
42: 'sa, = 1 Po TEE, 


the peer rankings. 


Interrelationship of Peer Rankings Across 
Situations 


As we have already noted, the interrelation- 
ship among the dimensions of content used in 
the peer judgments borders on the substantial 
in both the five- and the three-person groups. 
However, when one examines the predict- 
ability of peer judgments from five- to three- 
person groups, one notes that the largest 
correlations in the intersection matrix occur in 
the main diagonal and each dimension best 
predicts itself. Thus, across the situations the 
correlations are for Assertiveness, ra,,, = .38; 
for Sociability, rg,s, = .44; and for Emotional- 
ity, True, 44. Peer ranking scores on Soci- 
ability are significantly negatively correlated 
with peer ranking scores on Emotionality 
SEs —.32), but the magni- 
tudes are below those of the main diagonal in 
the intersection matrix. An additional sig- 


(Fs.x, — 23; 7 


nificant relationship occurs in this intersection 
matrix between the peer rankings on Sociabil- 
ity for the five-person group and the peer 
rankings on Assertiveness for the three-person 
group, although the relationship is small 
(7g,,, = .23). 

We may now examine the question of the 
predictability of peer ranking scores on the 
basis of self-rating and self-ranking scores. 
The peer scores, it should be recalled, do not 
involve self-rankings. 


Self-Judgmenis as Predictors of Peer Judg- 
ments 


Self-rating scores on Assertiveness are sig- 
nificantly correlated with corresponding peer 
ranking scores in the five- and the three-person 
groups (rai, = -31; 7a,4, = -23). In both cases 
self-rating scores on Assertiveness are also 
significantly correlated with peer ranking 
scores on Sociability (ras, = .25; ra,s, = .31). 
Self-rating scores on Sociability, on the other 
hand, are not significantly associated with the 
peer ranking scores on Sociability (rg, = 
07; Ts,s, .08). For the five-person groups, 
self-rating scores on Emotionality are sig- 
nificantly correlated with peer ranking scores 
on Emotionality (rex, .22), and the rela- 
tionship is maintained in the three-person 
group (rg,r, 17) although it falls short of 
significance. 

One notes first that self-judgments employ- 
ing a specific group of individuals as a frame 
of reference lead to more substantial predic- 
tive correlations for the peer ranking scores in 
the groups than do self-ratings made in the 
abstract. Thus, for the five-person groups, self- 
ranking scores on Assertiveness are substan- 
tially correlated with peer ranking scores 
(rasa, = -55). Interestingly, these self-ranking 
scores in the five-person groups also are sig- 
nificantly correlated with peer ranking scores 
for the three-person groups, and again at a 
relatively substantial level (r4,4, = .39). 

The findings for self-rankings in the five- 
person groups in terms of Sociability and Emo- 
tionality parallel those of Assertiveness; that 
is, self-rankings on Sociability are significantly 
correlated with peer rankings in the five- and 
three-person groups (rss, = .20; rss, = .23), 
although for Sociability the magnitudes are 
smaller. Similarly, self-ranking scores on 
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Emotionality for the five-person groups are 
correlated with peer ranking scores on Emo- 
tionality (rex, = .43; ree, = .22). 

There are additional correlates as well: self- 
ranking scores on Assertiveness for the five- 
person groups are correlated with peer ranking 
scores on Sociability in the five-person groups 
(ra,s, = -33); the parallel relationship occurs 
for three-person groups (ra, = .22) but does 
not extend between self-ranking scores of the 
five-person groups and peer ranking scores for 
the three-person groups (r4,s, = .10), or vice 
versa (fas, = —.07). For the five-person 
groups, self-ranking scores on Emotionality 
are significantly correlated with peer ranking 
scores on Assertiveness (rp,,, .21), and also 
to peer ranking scores on Assertiveness in the 
three-person groups (fg, = .21). However, 
the parallel relationships for  self-ranking 
scores on Assertiveness for the three-person 
groups are not significant (fe,4, = -08; rea, = 
.16). Self-ranking scores on Sociability for the 
five-person groups are significantly negatively 
correlated with peer ranking scores on Emo- 
tionality for both the five- and the three-person 


groups (fsen, = —.27; fan, = —.21); the 
parallel relationships for the three-person 
groups are not significant (rg2, = —.17; 
Tsk, -_ — .14). 


Examining now the self-rankings in the 
three-person groups, self-ranking scores on 
Assertiveness are positively correlated at a 
substantial level with peer ranking scores for 
the three-person groups (ra, = .61) and, 
also, although at a much lower level, with the 
Sociability and the Emotionality peer ranking 
scores (r4,s, = .22; ran, = .28); self-ranking 
scores on Sociability are positively correlated 
with peer ranking scores on Sociability for the 
three-person groups (fg,s, .27); while the 
correlation between self and peer ranking 
scores on Emotionality is in the correct direc- 
tion, the magnitude is not sufficient to satisfy 
the criterion of statistical significance (rp,», 
15). 

Finally, self-rankings for the three-person 
groups do not appear to be efficient predictors 
of peer judgments in the five-person groups. 
It should be noted, however, in this intersec- 
tion matrix all values in the main diagonal are 
positive although none are of sufficient magni- 
tude to satisfy the significance criterion 


(rasa, = .18; 75,5, 04; ree, = .15). 


DISCUSSION AND SUMMARY 

Self-rating scores appear to be direct pre- 
dictors of the self-ranking scores that are 
made after five- and three-person discussion 
situations (median r = .35), independently of 
the dimensions of content. This finding could 
result simply from an individual’s self-image 
or bias that he carries over into his self-per- 
ception when ranking himself in regard to any 
set of specific others. That is, if he judges him- 
self to be relatively high in an abstract sense 
(self-rating), he may assign himself a rank 
that is high without attention to the qualities 
of others, whether he does this unconsciously 
or deliberately. This is not, it should be noted, 
the elevation problem that Cronbach (1955) 
discusses, i.e., the tendency to rate all persons 
high, low, or at some given level, since the 
mean rank for each set of judgments is by 
definition the same. 

The conception of an abstract self-image or 
bias in self-perception is not easily detached 
from the question of stability of qualities and 
accuracy of perception. If persons are stable 
in their qualities, the stability in independent 
situations would be dependent upon the 
equivalence of sets of persons in the different 
situations. Thus, the variation of the char- 
acteristics of sets of group members must be 
thought of as an important source of variance 
in the “unreliability” of judgments in inde- 
pendent situations. But beyond this considera- 
tion, in order for there to be reliability there 
must be accurate perception of the qualities 
of the group members in the independent 
situations. The prediction diagonal of peer 
judgments from five- to three-person groups 
(AsAg, SsSs, EsEs) tends to be positive and non- 
trivial (median r = .44), thus establishing the 
relative stability of ranks in independent 
groups, irrespective of the dimensions of con- 
tent. Dependently, this also means that group 
members must view themselves consistently 
and with a common frame of reference. Since 
the prediction diagonals from self-rating and 
self-ranking scores to peer ranking scores 
(AjA3, $1S3, E,E3; A1As5, S1Ss, ExE5; “zA3, SeS3, EoE3; 
AoAs, S2Ss, E2E5; AgAs, S483, EgEs; and AgAg, S455, 
E4E5) tend to be positive and nontrivial (median 
ry = .22), group members must also have accu- 
rate perception of their qualities according to 
the group consensus. 

A point that needs to be emphasized here is 
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that the respondents were not asked to predict 
how others would rate or rank them. Thus, 
except for logical dependency indicated, the 
ability of the person to predict judgments 
others make is not involved. This was not the 
task assigned 

As for the magnitudes of predictions, it 
appears that correlations of self-rating with 
self and peer rankings are larger for the five- 
person groups (AjA2, SiS2, E:E2; median r = .43, 
AyA3, S183, ExE3;; median r = .22) than the three- 
person groups (AjAq, SiSs, E:xE4; median r 
26; AiAs, $1Ss, E1Es; medianr 17). This result 
accords with the view that the variation of 
sets of group members restricts the reliability 
of judgments, i.e., a more general base of 
judgment is available in five- than in three- 
person groups since sampling expectations dic- 
tate less variation in the means and variances 
of samples of larger size. In addition, of course, 
the concrete frame of reference for judgment 
is larger. 

Since dimensions of content were employed 
that tend to be defined orthogonally in factor 
analytic studies, the generality of findings 
could be examined over these dimensions. In 
general, findings tended to occur in parallel, 
supporting the notion that principles may be 
observed without regard for the content but 
also emphasizing the need for specific and 
inclusive dimensions in describing character- 
istics of persons. 
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STIMULI RELATED TO STRESS, NEUROTICISM, EXCITATION,”AND 
ANXIETY RESPONSE PATTERNS: 


ILLUSTRATING A NEW MULTIVARIATE EXPERIMENTAL DESIGN! 


RAYMOND B. CATTELL anp IVAN H. SCHEIER 


University of Illinois 


HIs research aims at three targets: 
1. To confirm and extend our knowl- 
edge of the various responses to what 
may broadly be called “threat,” and which 
include, according to present evidence, at least 
the three distinguishable states of anxiety, 
stress response, and neurotic response. 

2. To see if lawful relations can be found 
between the magnitude of response along these 
defined factor dimensions and the nature and 
magnitude of certain commonly occurring 
“stressful” or “threatening” situational stim- 
uli. 

3. To illustrate and gain experience of the 
new condilion-response design in multivariate 
experiment proposed by Cattell (1952) for 
combining the advantages of factor analytic 
and controlled experiment. The example here 
involves the measurement of 86 young men 
on 69 manifestations of anxiety, stress, and 
neurotic response on two occasions, differing 
by presence or absence of (a) imminence of an 
important examination, (b) a group “stress 
interview” by a questionnaire on emotional 
problems, and (c) the challenge of a punishing 
physical ordeal. 

A survey (Cattell & Scheier, in press-a) of 
current experiments in the area commonly re- 
ferred to as that of ‘‘anxiety” or “stress” 
response indicates certain persistent ob- 
scurities: (a) confusion as to whether stress, 
anxiety, etc., are to be defined by the nature 
of the stimuli or of the responses, (0) in either 
case, the use of arbitrary and conflicting 
choices of variables to measure or manipulate 
experimentally, and (c) undue dependence 
upon a single test or variable “‘to represent 
operationally” anxiety, stress, etc., when a 
priori it is likely that the variance of any 
such single variable will be composite, and 
the measure of anxiety or stress requires 

1 This article reports part of a study conducted under 


contract to the Research and Development Division, 
Office of the Surgeon General, Department of the Army. 
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representation in a single battery of several 
variables demonstrated to occur in the response 
pattern. 

The methodologically inadequate depend- 
ence on a univariate approach is illustrated by 
the work of Taylor (1951) and a succession of 
similar studies (Hilgard, Jones, & Kaplan, 1951; 
Ramond, 1953; Spence & Beecroft, 1954; Spence 
& Farber, 1953; Taylor & Spence, 1952), com- 
paring learning or conditioning for high and 
low anxiety groups, established as such on the 
basis of scores on one questionnaire said to be 
a face-valid measure of anxiety (Taylor, 1953). 
Almost all such studies employ only one or 
two tests alleged to represent anxiety, and 
then seek a relationship between this “anx- 
iety” and one or two other variables, such as 
conditioning or environmental stress, pre- 
sumed to have some connection with anxiety.” 
Unfortunately, the correlations among such 
confidently employed indicators of anxiety 
are often unknown, or when known, turn out 
to be low. 

By contrast, the multivariate approach first 
seeks to discover and define what general fac- 
tor or factors exist, as empirical constructs, 
among a wide range of semantically and theo- 
retically alleged psychological (and physio- 
logical) manifestations of anxiety. Regardless 
of how the discovered factor is named, a bat- 
tery of tests for such a factor is then known to 
measure a unitary response pattern. Researches 
to locate such unitary factors in the area of 
stress and anxiety were completed prior to the 
present research, and they have replicated 

2 The fact is that many (including, apparently, J. A. 
Taylor herself [1953]}) do assume that the Taylor Scale 
by design and definition is a measure of “anxiety.” A 
recent tendency to hedge on this position by disowning 
the claim to “anxiety” and simply calling it “drive 
strength” or one type of drive strength merely, repeats 
and accentuates the logical offense. Actually, there is 
evidence that drive strengths can be separately meas- 
ured in man (Cattell, 1957a), but the anxiety measure, 
though connected in a special way therewith, is con- 
ceptually and factorially distinguishable. 
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beyond doubt the pattern of (a) a general 
Anxiety factor, symbolized in the universal 
index of factors as U.I. 24 (Cattell, 1957a, 
1957b), (6) a stress response factor, symbolized 
as P.U.I. 4 (Cattell, 1955a, 1957a), and (c) 
a neurotic debility or “Low Neural Reserves” 
factor, U.I. 23(—) (Cattell, 1957a; Cattell & 
Scheier, in  press-b). Elsewhere (Cattell, 
1957a; Cattell & Scheier, 1958a, in press-a, in 
press-b) and in subsequent sections of this 
paper, we document our claims to the inter- 
pretation of these factor dimensions as con- 
stituting the firm, functionally unitary cores 
in the common concepts of Anxiety, Stress, 
and Neurotic Debility. 

Despite the satisfactorily definitive outcome 
of the researches confirming these factors, the 
main ten factor analytic experiments were only 
in terms of individual differences between 
people, at any one time, and of responses to 
fixed situations. That is to say, the simple 
structure patterns appeared in regular cross- 
sectional, R-technique analyses, defining éraits 
rather than states. Since then, however, two 
P-technique researches, factoring serial meas- 
ures in single individuals, have shown patterns 
so similar to those found in R technique that 
it is reasonable to call them by the same names 
and to expect that part, at least, of the indi- 
vidual differences existing at a given moment 
of cross-section (R technique) are due to dif- 
ferences in these temporary states. Thus, an 
anxious person is partly characterologically 
so and partly a person temporarily in an anx- 
ious state, the two patterns being so similar 
that they cannot be factorially separated—so 
far. 

However, in both R- and P-technique studies 
the situations productive of anxiety have been 
left to life happenings. In obtaining the true 
multivariate response patterns, which the uni- 
variate experimenter can never hope to see, 
we have hitherto sacrificed the connection 
with a stimulus in which he is interested. Such 
initial attention to the definition as a response 
patiern seems to us the proper strategic se- 
quence, and we now prepose to build upon 
these results by the niw condition-response 
multivariate design (Cattell, 1952), which 
links factor analytic and analysis of variance 
designs, and promises to connect these re- 
sponse patterns with the stimuli which gener- 
ate them. It also permits an independent 


contribution to the definition of the response 
pattern, for whereas P technique takes one 
person over many occasions, thus sampling 
well for occasions but poorly for people, the 
incremental R technique (embedded in the 
total condition-response design) samples only 
two occasions but performs the usual adequate 
sampling of people. If these approaches con- 
verge on a congruent definition of a single 
complex response pattern, we can be confident 
of having something transcending most sam- 
pling distortions. 

The general theory of condition-response 
design is given elsewhere (Cattell, 1952), but 
its nature will be illustrated in the following 
account of actual experimental procedures. 
Essentially, it measures each subject under 
two conditions (which would be high and low 
either on situational stimulus levels, as here, 
or internal conditions in, say, a psychophysio- 
logical experiment). At the same time, it 
measures all of a considerable variety of re- 
sponse variable magnitudes and then corre- 
lates changes in stimulus magnitudes with 
changes in response magnitudes, instead of 
relating significance of difference in one to 
difference in the other, as in analysis of vari- 
ance. Upon factoring this matrix, one obtains 
both (a) evidence (or, in our case, confirmation) 
for what unitary response patterns, e.g., 
anxiety, stress, etc., may exist in the given 
range of dependent variables, and (0) the sig- 
nificance and magnitude of the relations of 
these unitary response patterns to the inde- 
pendent variables constituted by the stimulus 
conditions. 

As pointed out above, evidence of the first 
can be obtained by R technique above, or P 
technique. Woodrow (1938) has factor ana- 
lyzed increment scores in the learning and 
mental ability domain, and several R and 
P studies have recently been carried out in 
personality data, but to our knowledge the 
present study is the first to perform incre- 
mental R technique in personality research in 
a design with manipulated, randomized, inde- 
pendent variables in what has hitherto been a 
theoretical proposition for a condition-re- 
sponse design. 


DESIGN AND PROCEDURE 


S’s were 86 male undergraduates: 52 unselected in 
relation to anxiety, 18 of high anxiety, and 16 of low 
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anxiety. The same 72 variables were scored for each of 
the 86 Ss on Occasion 1, and then again on Occasion 2, 
four weeks later. Sixty-nine of the measures entered 
in the matrix were changes in score from Occasion 1 to 
Occasion 2 on personality “response” variables; three 
scores were changes in the magnitude of stimuli, repre- 
senting “massive’’ environmental “stress’”’ conditions. 
The change score was always computed‘ as Occasion 2 
minus Occasion 1. 

Most of the response variables are well known, 
collected from previous attempts to measure anxiety, 
stress, neurotic debility, and several other previously 
established general personality factor dimensions. Most 
readers will wish only the indication of their nature 
given by their title and description in the text. Those 
requiring more detailed descriptions can obtain them 
from the authors or by reference to other researches, 
where the rationale for selection of personality factor 
markers is also explained (Cattell, 1955b; Cattell & 
Scheier, in press-c; Scheier & Cattell, 1958). The main 
aim in the choice of variables was to make possible the 
distinguishing of the threat-response factors from other 
known personality factors. 

The environmental conditions, i.e., the stimuli, 
were varied in a two-step, present-absent, on-off 
manner. The “on” phase for each is as follows. 

Anticipation of Treadmill Run.* Just before the 
testing session began, S was given a notice of an 
appointment, falling within two or three days, to 
take the “all-out treadmill run,” as part of his re- 
quirement for the testing program. This test, which 

Ss actually took, involved running on a treadmill to 

On the basis of previous personality profile infor- 
mation on athletes (Cattell et al., 1956), nine varsity 
athletes were included in the low anxiety group, while 
the other high and low anxiety subjects were pre- 
selected from the upper and lower 10% according to 
the preliminary criterion of scores on the anxiety factors 
(second order) in the 16 P.F. Questionnaire (Cattell, 
1956). These anxiety selections were not used as 
criteria, but only to give a good “spread” on anxiety 
responsiveness. 

*A good deal of discussion has been devoted by 
McNemar (1958) and others to the desirability of 
correcting change scores so that no correlation remains 
between absolute scores and change scores. Without 
space to discuss the complex issues involved we shall 
simply say that (a) this “ideal” seems to us very 
questionable, though McNemar’s formula with the 
goal of eliminating that part of the correlation which 
is due to shared error of measurement would be ac- 
ceptable, and (b) dealing as we are here with changes 
which are considerable, relative to those occurring in 
abilities, we have felt it desirable, in a first report, to 
give results as they emerge with a minimum manipu- 
lation of score data, reporting the more refined treat- 
ment of error in a later analysis. 

‘Throughout the research, extreme precautions 
were taken to prevent Ss from discussing their research 
experiences with one another. It is believed these 
precautions were successful, specifically in preventing 
“anticipation of treadmill stress’ at occasion 1, for 
those S’s to whom it was not applied until occasion 2. 


the limit of endurance (some had to be helped off the 

instrument) before an audience of physical education 

specialists, including at least one woman. S was 
advised of these details in the notice. 

Imminence of Academic Examinations. Each S 
was tested once just before semester exams (or in 
the midst of them)—the “on” stimulus—and once 
approximately four weeks before examinations, in 
what we assumed was a relatively carefree period. 

Questionnaire “Group Stress Interview” Probe. 
A list was made of the common themes for generating 
upset in interview stress (personal defects, economic 
insecurity, war dangers, threat of illness, etc.). These 
were given as stimuli in questions which all Ss 
answered instead of by interview questions. This 
loses the freedom to probe where it most hurts for 
each individual, but it compensates by more stand- 
ard conditions. Immediately before the test session 
assigned for “stress-on,’’ S was provoked by re- 
sponding to 40 items reminding him of common 
sources of worry: e.g., finances, military service, 
health, academic status, etc. Typical questions were: 
“Have you ever noticed or suspected any symptoms 
of insanity in yourself? Specify briefly.” “Are you 
in debt now?” “Are you doing as well in school as 
your parents (or you yourself) hoped you would? If 
not, why not?” 

It is an important part of the condition-response 
design (Cattell, 1952) that the various stimuli used be 
randomized in relation to one another, as in an analysis 
of variance with several effects. Otherwise, any correla- 
tion among the stimuli would have only local meaning— 
an artifact of the experimenter’s design—and would 
complicate the interpretation of the correlation with 
the response patterns. Accordingly, a design was set 
up for combining the high and low (on and off) inci- 
dences of the three stimuli with equal frequency. 
Actually, the essential aim is to have the changes from 
first to second occasion on each stimulus uncorrelated 
with changes in the other two stimuli, and to have a 
step down (“‘on” to “off’’) between occasion one and 
two just as frequent as a step up, i.e., to avoid con- 
founding with a particular sequence effect. This 
requires eight combinations, as shown in Table 1. 

A given stimulus was in the “on” condition for 
approximately half the Ss at Occasion 1 and “on” for 
the other half at Occasion 2. All possible on-off com- 
binations of the three conditions occurred at each time; 
hence, all possible combinations of increments and 
decrements occurred between Occasion (or “time’’) 1 
and 2. An increment in a given condition (“off to 
“on”) between Time 1 and 2 was scored in the subse- 
quent correlation as 2, a decrement as 1. Thus for all 
Ss in Group (Cell) 1, the stimulus scores entered were 
T = 2, E = 1, and P = 1; whereas for group (Cell) 2, 
the corresponding scores entered were 2, 2, and 1. 
Positive correlation (phi coefficient) between a con- 
dition and a response increment variable thus signifies 
that increase in the amount of this “stress” condition 
is associated with increase in the response variable, 
from the first to the second occasion. (No “on-on”’ and 
“off-off’’ sequences were studied in this first attack, 
though they could be used, giving a middle value in a 
three score range). 
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TABLE 1 


INCIDENCES OF THE THREE “StrREsS’ STIMULI AT 
Tre 1 AND Trwe 2 


Subject Time | Time 2 (4 Weeks later) 
Group 
1 r—, E+, P+ r+, E-, P- 
2 r—, E-—, P+ T+, E+, P- 
3 r—, E-, P- T+, E+, P+ 
4 r+,E-,P r—, E+, P+ 
5 r+, E+, P- T-—, E-, P+ 
6 r+, E+, P+ r—, E-, P- 
7 r+, E—-, P+ r—, E+, P— 
8 I 2+, P r+, E—, P+ 
Note a. E.”’ and “P”’ signify Treadmill, Examination, 
and Questionnaire Probe Stress. “‘+"’ indicates the “‘on’”’ state, 
“—"' the “off” state 


* Due to differential attrition of subjects in the eight groups, 
N's were only approximately equal for these categories, varying 
between 10 and 12 for all groups except groups three and six, which 
had N’s of 8 and 14 respectively. 


The condition-response design, like analysis of 
variance, results in a rapid multiplication of the num- 
ber of categories required (treatment subgroups) as the 
number of stimuli increases. Consequently, one will 
usually be constrained to deal with far fewer stimuli 
than responses, but this suits the objective of the 
design, which is to examine directly the relation be- 
tween changes of intensity in single stimuli and changes 
in intensity of demonstrated response patterns. Of 
course, more response variables are needed to define 
the latter 

In the classical univariate design, one examines the 
significance of the difference of the dependent variable 
values corresponding to the two levels of the stimulus 
(or the two different effects). The equivalent of this 
in the present design is to examine the significance of 
the correlation of the stimulus variable with the re- 
sponse factor.* Parenthetically, the loading of the 
stimulus on the response factor, as immediately ob- 
tained by the present proposed factoring, is not identical 
with the correlation when factors are oblique, and re- 
quires further calculation; but the departure is so 
slight with the small obliquities obtained here that we 
have not considered the complex calculations justified 
and shall speak of the loading as the correlation. 

The condition-response design gives more informa- 
tion than does analysis of variance (a) in tying down 
the “dependent variable” in terms of response dimen- 
sions (in groups of tests) and also (b) because it reveals 
not only whether the relation of stimulus change and 
response change is significant, but also how large it is. 
On the other hand, it says nothing about interaction 
et.ects. However, it should be noted that even if each 
of the several stimuli were actually perfectly related 
to the response (in the absence of other variables), 
the correlation of each therewith could not be unity 


* Tt is unnecessary, of course, to take the correlations 
of the stimulus change with the changes on the variables 
in the response factor and to estimate therefrom the 
correlation with the factor behind them. Factor analysis 
gives directly the correlation with the true factor 


If they vary independently (see Table 1 and discus- 
sion), the summed squares of their correlations with 
the common-response could not exceed unity. Thus, in 
the present instance the three individual loadings could 
not exceed 1/3, ie., .58. Since our experimentally 
controlled, deliberate manipulation of the stimuli is not 
the sole variance in these stimuli, the maximum is 
actually lower than this. For example, if our imposed 
range of examination stress, between stressless periods 
of the semester and semester exams, is, on the average, 
half of the individual’s actually experienced range of 
examination stress, a full response of the anxiety 
factor in our experiment to the controlled stress would 
still yield a correlation of only about .41. These points 
should be kept in mind in our subsequent examination 
of the stimulus loadings 


RESULTS 


Replication of the Four Threat Response Pat- 
terns. According to the above design, difference 
scores on all variables between the first and 
second occasion, including the prearranged 
differences on the stimuli, were computed 
over all 86 individuals, and product-moment 
intercorrelated. A centroid factor extraction 
from the 72 X 72 matrix was terminated at 
14 factors, according to criteria of complete- 
ness of extraction described elsewhere (Cat- 
tell, 1952) and rotated to a simple structure 
which, after 25 rotations, reached a plateau 
of 59% of variables in the hyperplane. 

A separate report gives full data and general 
descriptive comments on all fourteen factors 
(Cattell & Scheier, 1958b). The four factors 
chosen for concentrated attention here as 
“threat response” factors were so chosen be- 
cause (a) their loaded response variables most 
strongly suggest the character of “‘response to 
threat”” as commonly conceived and (}) with 
the minor possible exception noted below, 
they were the only well identified factors sig- 
nificantly associated with at least one of the 
three ‘“‘stress” stimuli. Of the other ten factors, 
five had a poor or only fair identification with 
previously known personality dimensions (Cat- 
tell, 1957a; Cattell & Scheier, 1958b), and 
three, though well identified, had no signifi- 
cant stimulus association (U.I. 34, Autistic 
Nonconformity; U.I. 27, Apathy; and U.I. 35, 
Self-Reliance). In fact, only two of the ten 
factors falling outside the original conception 
of threat-response factors had reasonably good 
identification plus significant stimulus asso- 
ciation. These were factor U.I. 19, Critical 
Practicality, increased by anticipation of tread- 
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mill run, and factor U.I. 17, Inhibition, de- 
creased by examination imminence and by 
anticipation of treadmill run. If their identifi- 
cation holds up and improves, these two fac- 
tors may in the future be added to the four 
threat-response factors, in spite of the fact 
that the responses they involve are not so 
obviously “face valid” as are the four factors 
discussed here. 

The four factors which replicate previous 
patterns, considered as responses to stress in 
the generalized sense, are: (a) Anxiety, re- 
corded as P.U.I. 9(—) as a state and U.I. 24 
as a trait (Cattell, 1957a; Cattell & Scheier, 
1958a); (6b) Neurotic Debility, recorded as 
U.I. 23 as a trait (Cattell, 1957a; Cattell & 
Scheier, in press-b) and P.U.I. 8(—) as a state 
(Cattell, 1955a); (c) Frustrative Depression or 
Pathemia vs. Elation, recorded as P.U.I. 2 
Cattell, 1957a) and having resemblance to 
he Trait Corticalertia vs. Pathemia, U.I. 22, 
and (d) Stress response, P.U.I. 4. The positions 
of these factors in rank order of diminishing 
variance of the fourteen factors in this study 
were 12, 3, 2, and 11 respectively. For ease of 
reference to the total matrix (Cattell & Scheier, 
1958b), these reference numbers will be em- 
ployed here. 

In a moment we shall examine the signifi- 
cance of the relation of change on these factors 
to stimulus changes, but first let us establish 
more definitely the nature and identification 
of the four response patterns themselves. As 
usual, each pattern is set out in terms of the 
highest-loading variables with their loadings— 
actually the correlations with the reference 
vectors, which are proportional to the factor 
loadings (Cattell, 1952). For economy of pres- 
entation, the salients are cut off at a lower 
loading, one of .20, which is still well above 
the P=.01 significance level for a loading 
according to Saunders’s formula (Cattell, 
1952), which indicates a P=.01 value of 
about 0.16. Correlation coefficients, to which 
these loadings are roughly equivalent, attain 
significance at the P = .05 level, at a value of 
.21 for an N of 86 cases. 

Factor 12 clearly identifies with the Anxiety 
factor, U.I. 24 (Cattell & Scheier, 1958a), 
having in common with the established pat- 
tern such variables as the symptom check list, 
the Q4 and C(—) questionnaire factors, and 
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objective test markers’ (Table 2, Loadings 
No. 3 and No. 7). Its identification with P.U.L. 
9(—), Anxiety state or “high general auto- 
nomic activity level” (Cattell, 1955a) is made 
through the pattern of higher systolic blood 
pressure (No. 6), lower skin resistance (No. 11), 
and (at lower loading than the cut-off) more 
acid pH saliva. Loading Number 12 fits this 
latter identification only with special assump- 
tions. 

Some differences are observable between 
this pattern of ‘“‘anxiety as a state” and the 
earlier studies fixing the pattern of “‘anxiety as 
a trait” (R technique, U.I. 24). Notably, such 
U.I. 24 objective test markers as “tendency to 
agree,” “tendency to confess faults,” and 
“severity of criticism” load less here (below 
.20) than in the trait pattern, though still in 
the correct direction. The same is true of 
questionnaire factor O, or Guilt Proneness, 
and it is tentatively concluded that these are 
mainly characterological, not state, concomi- 
tants of anxiety. Conversely, such variables as 
systolic blood pressure have not appeared re- 
liably in the /ratt pattern. 

The second response pattern to be identified 

Neurotic Debility—is shown in Table 3. 
Variables (No. 2) and (No. 4) are those found 
most frequently to characterize the original 
Neurotic Debility Factor, U.I. 23(—) (Cat- 
tell & Scheier, in press-b); questionnaire fac- 
tor C(—), lower Ego Strength, also belongs, 
but the loading for the sway suggestibility 
marker is only .10, so the pattern of the state 
must again be considered as somewhat modi- 
fied from the trait, though still unmistakably 
characteristic. Less interest in reality-oriented 
love attachments (No. 1), occurring simul- 
taneously with higher desire to satisfy sexual 
needs (No. 3), is consistent with a hypothesis 
of failure to sublimate, but this must be con- 
sidered speculation pending further research. 


7 By “marker,” we mean a variable which, through 
a number of different studies, has significantly loaded 
a factor in a consistent direction (See Cattell 1955b; 
Cattell, 1957a). After extensive research has established 
markers, each factor can be identified in new data in 
terms of the presence or absence of the group of markers 
known to load on it from previous work. The markers 
referred to in the discussion will be most conveniently 
found in overview references (Cattell, 1955a; Cattell, 
Cattell, Cattell & Scheier, 1958a; 
Cattell & Scheier, in press-a). 


1955b; 1957a; 
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TABLE 2 
Factor LOADINGS ON ANXIETY* 

1 +41 More anxiety-tension symptoms (self- 
checklist) 

2. +40 Q4(+) Greater Ergic Tension 

3 +35 Less confident in assuming skill in untried 
performance 

4 32 Less long-circuited (predominance of short 
term life goals 

5. -—31 Less persistence in unrewarding situation 

6. +30 Higher systolic blood pressure 

7. 30 Lower average handwriting pressure 


8 29 C(—) Lower Ego Strength 

9 +27 Greater volume of saliva secreted 

10. —26 Less work decrement due to anxiety sound 
distraction 

11 26 Lower absolute level of galvanic skin re- 
sistance 

12 -22 Smaller pupil diameter during stress 

13 21 Lower ratio of “acquaintances” to 
“friends’’ remembered 

14. +20 More desire to satisfy sexual needs 

15 +20 More slowing of reaction time due to 


electric shock 


Note.—In Tables 2 through 5, we give loadings for all response 
variables which appeared in this study and loaded on (correlated 
with, associated with) the factor dimensions at .20 or more. Table 
6 (left-hand side) gives all stimulus associations found, no matter 
what their size 


* Factor 12: U.I. 24 ; P.U.I. 9(- 


TABLE 3 
Factor LOADINGS ON Neurotic DeBILity* 


1. -—46 Less interest (objective test) in complex, 
reality-oriented, love attachments 

2 -40 Lower ratio accuracy (number correct) 
to accomplishment (number done) on a 
variety of tasks 

3. +39 More desire (objective test) for sexual 


pleasure 
More rigidity 
ceptual) 


4. +30 classical motor and per- 


5 26 Lower ratio warned/unwarned reaction 
time 

6 26 Smaller size of myokinetic movement 

7 +24 More acceleration to competitive situations 

8 21 C(—) Lower Ego Strength 

9. +21 More errors made in complex reaction 


time runs 


10. 21 Less reluctance to enter fear situations 
11. +20 More anxiety-tension symptoms (self- 
checklist) 


12. +20 Prefer weaker (vs. stronger) smells 


*Factor 3: U.I. 23 P.U.I. &(- 

The original P.U.I. 8(—) state or change- 
hrough-time pattern from longitudinal (P 
technique) individual studies (Cattell, 1955a) 
is recognized also here as a combination of 
heightened rigidity (No. 4), smaller size of 
myokinesis (No. 6), and reduced hand tremor 
—the latter loading below the cutting point. 
It is concluded that this is the state factor 
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corresponding to the trait of Neurotic Debility, 
one of the most important factors distinguish- 
ing between clinically judged neurotics (U.I. 
23(—)) and normals (U.I. 23(+)) (Cattell & 
Scheier, in press-a). U.I. 23 is empirically inde- 
pendent of Anxiety (Cattell, 1957a; Cattell & 
Scheier, 1958b), which also significantly dis- 
criminates between neurotics and normals 
(Cattell & Scheier, in press-a). ; 

The next factor has never appeared as an 
inter-individual difference (R technique) or 
trait pattern, but has appeared in four P- 
technique studies (Cattell, 1957a, p. 651) as a 
state pattern. The matching rests on “fewer 
circles used in the cursive miniature situation 
test” (Table 4, No. 1), reduced myokinetic 
movement (No. 8), reduced rigidity (No. 9), 
and loading in the correct direction but below 
the cutting point, better memory performance 
and other “‘concentration” performances, not- 
ably low sway, and good multiplying. Cattell 
(1957a) has hypothesized that this is a stress 
pattern, distinct from anxiety or fatigue fac- 
tors, like that produced by physical stress or 
disease, but here (as frequently occurs) pro- 
duced by mental effort. 

The last response pattern, shown in Table 
5 cannot be matched with complete statistical 
confidence to the U.I. 22(—) factor (Low 
Cortical Alertness; depression and Emotional 
Immaturity), which it most nearly resembles 
among known factors. The resemblance ex- 
tends to a combination of slow reaction time, 
fewer false reactions in a reaction time test, 
greater persistence, and (loading below the 
cutting point) slower tempo, and less fidgeting 
recorded on fidgetometer. However, there are 
also several variables present here, notably 
No. 1, which have never before been found to 
load U.I. 22. Therefore, although no possible 
alternative U.I. 22 match exists in the full 
series of 14 factors in this study, we have 
contingently named this large variance factor 
“Low General Excitation Level” as well as 
“Low Cortical Alertness” or ““Pathemia” on 
the chance that these are real alternatives and 
not just different names for what is essentially 
the same functional unity. Conceivably, the 
alignment with U.I. 22 is correct, but the 
pattern of the sfate and of the érait (relatively 
permanent high vs. low Cortical Alertness) 
are more divergent than state and trait com- 
monly are. 
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TABLE 4 


Factor LOADINGS ON Errort STRESS,* OR GENERAL 
ADAPTATION SYNDROME (STAGE ONE) 


1. —36 Fewer circles used in Cursive Miniature 


Situation (CMS) 

2. -—35 0O(—), More confidence, Self-assurance 

3. +30 Overestimate length of fear periods 

4. —26 Lower systolic blood pressure 

5. +24 Greater number of threatening objects 
perceived in unstructured drawings 

6. —24 Less desire (objective test) for sexual 
pleasure 


7. +23 More persistence of behavior in unreward- 
ing situation 
8. —21 Smaller size of myokinetic movement 


9. —21 Lower rigidity (Classical motor and per- 
ceptual) 


10. —20 Lower index of carefulness 
* Factor 11: P.U.I. 4(+). 


TABLE 5 
FactorR LOADINGS ON PATHEMIA*® (FRUSTRATIVE 
DEPRESSION OR LOW CorTICAL ALERTNESS) 
1. +46 More honesty in admitting common 
frailties 


2. +43 High fear (accident-disease) 

3. —43 Smaller amount of GSR conditioning 

4. -—37 Relatively less fluent about self than 
about others 

5. —34 Fewer errors in complex reaction time 

6. +34 More slow (irregularly warned) reaction 
time 

7. —34 Smaller magnitude of hand tremor 

8. —33 Less involuntary muscle tension in 


right arm 

9. +31 More persistence in unrewarding situation 

10. +30 More recovery of GSR level after relaxa- 
tion from shock 

+28 More desire for self control 

12. +27 More threatening objects perceived in 
unstructured situation 

13. +27 Greater volume of saliva secreted 


14. —26 Fewer circles used in Cursive Miniature 
Situation test 

15. —26 Less desire (objective test) for sexual 
pleasure 


* Factor 2: P.U.I. 2; U.I 


i) 
i= 
| 


The Relation of Response Patterns to Stimulus 
Conditions. Correlations between the factors 
(actually, reference vectors) representing the 
response patterns are given in the right hand 
section of Table 6. The left hand section of 
Table 6 gives relations between each of the 
three stimuli and the four threat-response 
factors. 

These results may be evaluated in terms of 
(a) the significance of a correlation on 86 
cases, which must be .21 or above for signifi- 
cance at the P = .05 level, or (0) the signifi- 
cance of a loading, which is taken here as .16. 


TABLE 6 


CORRELATIONS GF RESPONSE FACTORS WITH STIMULI 
AND WITH ONE ANOTHER 

Antici- 

_ pation 

Probing of 

Exam Ques- Tread- 
Immi- __tion- mill ULL. ULI P.U.I. 
nence naire Run 24(+) (23)(—) 4(+) 


U.I. 24(+) Anx- —25 00 +13 


iety 
U.I. 23(—) Neu- +04 +06 —35 +26 
rotic Debility 
P.U.I1. 4(+) +16 +28 —16 —12 +09 


Stress or Gen- 

eral Adapta- 

tion Syndrome 

U.L 22(—) +20 +02 —07 +11 —04 +15 

Lower Corti- 

cal Alertness 

or Excitation 

Level Lower 


We shall summarily depend on Criterion @ to 
be on the safe side. On this criterion, it becomes 
evident, first, that no single response factor be- 
haves in the same way to all three threat situa- 
tions; that is, no response dimension, as a 
“dependent variable,” is significantly affected 
in the same direction by all three “stress” 
stimuli. There is not even a consistency in 
direction of sign for all three stress stimuli on 
a given factor. Secondly, two of the three 
highest relations are in the opposite direction 
to that which naive views might hypothesize. 
Thus, the anticipation of a treadmill run sig- 
nificantly reduces Neurotic Debility and the 
incidence of examinations significantly reduces 
Anxiety. However, presence of probing ques- 
tionnaire threat is significantly associated with 
increased Effort Stress Reaction, P.U.I. 4(+), 
as would commonly be expected. 


DISCUSSION 


Although it is a possibility that future re- 
search will discover “‘stressful” stimuli which 
do act on the Anxiety factor in the naively 
expected direction, the fact we have to assimi- 
late here is that the expectation can be quite 
wrong. Experimenters have assumed anxiety 
to arise from stress stimuli without regard to 
the clinically common observation that a per- 
sonality beset by vague fears and anxieties 
often loses these anxieties for a while when a 
real danger threatens. For example, in dis- 
cussing the case of Helen, May (1950, p. 332) 
states: 
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So long as her apprehension could be attached 
to the possible sufferings in childbirth—a focus of 
fear which could easily be construed by Helen as 
‘rational’—she was not faced with the much more 
difficult problem of confronting the underlying guilt 
feelings, even the admission of which would have 
threatened her central psychological protective 
strategies and thrown her into profound conflict. 
Incidence of examinations may be a pecul- 

iarly efficacious “focus of fear’’ in furnishing a 
pretext for the excessive busyness which is a 
commonly accepted means of allaying anxiety 
(May, 1950, p. 172). This phenomenon is not 
necessarily merely a switch from free floating 
anxiety to a delimited phobia; a more funda- 
mental process may be involved, which is 
beyond the space of the present article to dis- 
cuss. On this point we shall content ourselves 
with a simply stated hypothesis, arising from 
the present research and susceptible to check- 
ing by a similarly designed study: namely, 
level on the anxiety state factor (P.U.I. 9%—}) 
declines when an external challenging situation, 
demanding and permitting action, actually ma- 
terializes after a long period of anticipation. In 
fact, the relation of anxiety to alleged anxiety- 
stimulating temporally much 
more complex than has been assumed in vali- 


situations is 


dating a priori anxiety tests. Thus, here the 
impact of an actual examination seems to re- 
duce anxiety, while elsewhere (Cattell & 
Scheier, in press-a) we have shown that news 
of failure in an examination (where nothing 
can be done immediately) significantly raises 
the level on this same defined anxiety factor. 

Anticipation of the treadmill run, though it 
slightly (barely significantly) raises Anxiety, 
acts powerfully to reduce the state we have 
called Neurotic Debility (see Table 6). The 
hypothesis that can best be formulated from 
the present findings is that challenge to effort, 
with minimal involvement of major sentiment 
systems and their attendant may 
slightly raise anxiety, but stimulates the capacity 
to mobilize (reduces level on the factor of Neurotic 
Debdility (U1. 23|—]). The implication favor- 
ing therapy involving shock or mild “stress” 
may be better discussed after the significant 
finding here is verified with other stimuli rele- 


anxieties, 


vant to the hypothesis. 

Although the results for the effort stress 
factor (P.U.I. 4) are not all sufficiently sig- 
nificant, it comes nearest to being confirmed 
as a general, constant, immediate response-to- 


threat dimension. In fact, if we recall that the 
positively associated examination incidence 
and probing questionnaire “threats” are 
stimuli to which response is already substan- 
tially in action at the time of the testing, 
whereas the treadmill is only something that 
is going to happen, we may tentatively hy- 
pothesize that the Effort Stress reaction pattern, 
P.UJI. 4+), appears typically in immediate 
response to a threat situation demanding effort. 
This hypothesis predicts that treadmill stress 
would have loaded positively on P.U.I. 4 if 
our response measurements had been made 
during the treadmill run or closer in time to it. 

To justify our emerging hypothesis about 
the fourth of our state factors requires colla- 
tion from further afield than present space 
permits. For various term 
“Pathemia” has been applied to this state 
(U.I. 22(—), P.U.I. 2), describing a state of 
“depression-with-emotionality” and reduced 
cognitive clarity characteristically resulting 
from certain kinds of frustration. In the pres- 
ent experiment, the actual examination situa- 
tion produced an increase in this pathemic 
state, but neither of the other stimuli did. We 
hypothesize that the pathemic response is a 
depressive response to a frustrating situation 
too complex for pugnacity, the depression char- 
acteristically showing regressive emotionality 
along with the depression. 

In general the results support the methodo- 
logical argument set out elsewhere (Cattell & 
Scheier, in press-a) regarding the study of 
states and moods. The first, necessary step is 
to recognize the chief independent response 
patterns. For what is to be called a stressful, 
anxiety-provoking, depressing, etc., stimulus, 
has to be decided by its demonstrable connec- 
tion with such reactions, not by the experi- 
menter’s fiat. Second, the psychological mean- 
ing of any stimulus can be defined by its profile 
of correlations with these independent re- 
sponses, as in Table 6. Here, this is a statement 
for the general population, but individuals 
characteristically show unique combinations 
of the common factor responses to any given 
stimulus. 

At the same time, the labels we decide to 
give to the discovered response patterns must, 
for semantic reasons, fit the popular adjective 
applied to the majority of situations by which 
the response patterns are triggered. Thus, the 
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label “stress response pattern” is properly at- 
tached to P.U.I. 4, partly because this re- 
sponse turns out to be most connected with 
stress situations per se, but also partly because 
the responses loading the pattern are those 
theoretically to be expected in an organism in 
stress. Once it becomes so labelled in general 
psychology, the right of any stimulus to be 
named “stress” must henceforth depend on its 
being a significant provoker of ihis particular 
pattern of response. Actually, the central fea- 
ture in this stress turns out to be a demand for 
powerful effort, whereas, popularly, “stress” 
has been used in the title of articles dealing 
with anxiety-provoking or simply unpleasant 
or threatening, flight-evoking stimuli. Simi- 
larly, the identity of the Anxiety Factor is de- 
fined and given meaning originally through the 
introspected anxiety in many of the responses 
found loaded in the pattern. However, this 
meaning is enriched by the significant rela- 
tions to various stimuli, e.g., examination 
threat, etc., even though the direction of the 
relation requires the hypothesis of a peculiarity 
of personality dynamics. 

If these responses have now been defined as 
recognizably unique response loading patterns, 
we can henceforth unambiguously designate 
stress stimuli as those which provoke the 
stress response factor (P.U.I. 4) and which, 
according to our hypothesis, have the quality 
of evoking a great deal of physical and mental 
effort. Anxiety stimuli will be those which pro- 
duce changes in the U.I. 24 response pattern 
or, by our hypothesis, seem to affect or 
threaten the individual’s massive life purposes 
(sentiment systems). Mobilization stimuli will 
be those which primarily determine changes 
along the Neural Reserves or U.I. 23 dimen- 
sion; by our hypothesis, such stimuli challenge 
and mobilize rather than provoke anxiety or 


stress. 
SUMMARY 


1. A condition-response factor analysis has 
been applied to three stimulus situations and 
69 response variables on 86 young adult males, 
revealing fourteen response patterns of which 
four are considered to have most importance 
as threat-response characteristics. 

2. These four “state” response factors found 
among increments of response are recognizably 
similar (a) to three factors repeatedly found by 


R technique or “trait” study of absolute 
levels; namely anxiety (U.I. 24(+-)), Neurotic 
Debility (U.I. 23(—)), and Pathemia or Low 
Cortical Alertness (U.I. 22(—)) and (0) to 
factors replicated in P-technique, namely, 
Anxiety state, P.U.I. 9(—), Pathemia, P.U.I. 
2, Neurotic Debility, P.U.I. 8(—), and stress 
or General Adaptation Syndrome (P.U_I. 
4(+)). 

3. Significant positive relations are found 
(a) between decline on the Anxiety factor level 
and being in an examination “stress” period; 
(6) between low level on the Neurotic Debility 
factor and the anticipation of an all-out tread- 
mill run; (c) between higher level on the Stress 
factor and undergoing the probing question- 
naire and examination imminence “‘stresses’’; 
and (d) between examination imminence and 
lowered cortical alertness (Pathemia or Frus- 
trative Depression). Specific testable hypothe- 
ses are proposed to the effect that (a) response 
to real danger may reduce anxiety, (0) chal- 
lenge may reduce Neurotic Debility, and (c) 
nocive, unpleasant, or demanding stimuli, 
when operating so as to require active response 
at the time of measurement, may produce 
stress-factor response. 

4. In the interests of avoiding problems 
created only by terminology, it is suggested 
that the whole class of unpleasant, threaten- 
ing, or effort-demanding stimuli sometimes 
called “stress” be more precisely defined, dif- 
ferentiated, and labeled in terms of their load- 
ings on response dimensions, specifically along 
the Anxiety dimension (U.I. 24{+]), the 
Effort Stress dimension (P.U.I. 4[+]), the 
Neurotic Debility dimension (U.I. 23[—]), or 
the Pathemia dimension (U.I. 22[—]). 

Thus, as part of a general taxonomy of situa- 
tions, it is proposed that what have loosely 
been called stress stimuli should be classified 
according to their discovered profiles of loading 
on these four (and possibly two or three yet 
unconfirmed) response factors. Individuality 
of emotional response to the same stimulus 
could similarly yield precise quantitative state- 
ment in terms of unique factor profiles. 


““ 
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INTERFERING AND FACILITATING EFFECTS OF TEST ANXIETY! 


BRITTON K. RUEBUSH 
Yale University 


NTERFERING effects of test anxiety in chil- 
dren were observed in an experimental 
learning study reported by Waite, 
Sarason, Lighthall, & Davidson (1958). In 
another study, Lighthall, Ruebush, Sarason, 
& Zweibelson (1959) found that while low 
test anxious (LA) children improved more 
than high test anxious (HA) children over a 
two-year time span on the “test-like” Otis 
Beta Intelligence Test, the HA improved more 
over the same period on the “game-like” 
Davis-Eells Test of Problem-solving Ability. 
These findings suggest that although test 
anxiety has interfering effects upon intellectual 
performance under some conditions, it may 
have facilitating effects under other conditions. 
The present study was designed as a test of 
several predictions derived from a simple theo- 
retical model, formulated to account for the 
fact that test anxiety may have both inter- 
fering and facilitating effects upon intellectual 
performance. In general, the model holds that 
the effect of anxiety upon performance is 
mediated primarily by defensive reactions to 
the anxiety. This hypothesis is supported by 
the results of a study in which it was found 
that the Rorschach performance of HA chil- 
dren was significantly affected by their cautious 
and dependent behavior in the test situation 
(Sarason, Davidson, Lighthall, & Waite, 
1958b). In particular, the model states that 
(a) defensive reactions such as cautiousness 
have been strongly overlearned and thus are 
stable “automatic” responses to the anxiety 
triggered off by such situations such as tests, 
(b) the effects of anxiety upon performance are 
mediated by these defensive reactions to the 
anxiety, (c) in problem solving situations 
where such defensive reactions are an asset, 
anxiety has a facilitating effect upon perform- 
1 This study is part of a project of which Seymour B. 
Sarason is the principal investigator and which is being 
supported by a grant from the United States Public 
Health Service (M-712). The author wishes to express 
his deep appreciation to Seymour Sarason, Kenneth 
Davidson, Frederick Lighthall, and Richard Waite 
for their interest and helpful suggestions in the design 
and execution of this investigation. 


ance, and (d) in problem solving situations 
where such defensive reactions are a liability, 
anxiety has an interfering effect upon per- 
formance. 

The criterion task selected for the present 
study provided a problem solving situation in 
which cautious behavior was expected to 
facilitate performance. Therefore, three spe- 
cific predictions were made: (a) Highly cau- 
tious Ss do better than low cautious Ss on the 
criterion task; (6) HA Ss obtain higher cau- 
tiousness scores than LA Ss; and (c) HA Ss 
perform more effectively on the criterion task 
than LA Ss. 


METHOD 


Subjects and Research Design 


The population from which the children in this 
study were selected was composed of 280 sixth-grade 
boys from the 11 elementary schools in the predom- 
inantly middle-class community of Hamden, Connecti- 
cut. Within a period of approximately two months 
preceding the selection of the final sample, the entire 
sampling population received the Yale Test Anxiety 
and Yale General Anxiety Scales, as well as the Beta 
Test Form Em of the Otis Mental Ability Tests.? Pre- 
vious reports have described the Yale anxiety scales 
and given evidence pertaining to their validity (Sarason, 
Davidson, Lighthall, & Waite, 1958a; Sarason et al., 
1958b; Sarnoff, Lighthall, Waite, Davidson, & Sarason, 
1958; Waite et al., 1958; Zweibelson, 1956). As the 
first step in selecting the final sample, Ss whose test 
anxiety scores were <5 were designated as LA, and 
those whose test anxiety scores were >15 were termed 
HA. These cutoff points included approximately the 
lower and upper quartiles of the total sampling popu- 
lation. Second, each of these LA and HA Ss whose 
Otis Beta IQ score fell within the interval 89-122 was 
then assigned to one of three levels of intelligence: 
89-100, 101-110, or 111-122, designated as low, 
medium, and high levels, respectively. Third, eight Ss 
were selected by a table of random numbers from each 
of the six anxiety-IQ groups. Fourth, each of the 
groups of eight Ss was randomly split into two groups 
of four, one of which was to receive the easy criterion 
task items first, the other to receive the difficult items 
first. To sum up, the research design of the present 
study was a 2 (anxiety levels) X 3 (intelligence levels) 
X 2 (order of administration of easy and difficult items) 
mixed factorial arrangement, with four Ss in each cell. 





2 The author wishes to thank Kenneth Davidson 
and Jane Maltby for their help in collecting these data. 
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The within-Ss effect consisted of a main effect of trials 
(item difficulty) plus the interaction of trials with the 


between-Ss effects (anxiety, intelligence, and order) 


Criterion Task 


The criterion task used in this study consisted of 
the 12-item short form of the Witkin Embedded Figures 
Test developed by Jackson (1956). Diagrams and de- 
tailed information about the figures are given by 
Witkin (1950). Previous studies have demonstrated 
that the criterion task is well within the intellectual 
grasp of the population used in the present study 
Witkin, Lewis, Hertzman, Machover, Meissner, 
& Wafner, 1954). The task items were ranked in order 
of difficulty according to the mean solution time 
achieved on each item by an independent sample of 26 
13-year-old-boys.’ Ranked from easiest to most difficult 
and using Witkin’s (1950) numbering system, the rank 
order of difficulty of the items was as follows (mean 
solution times in parentheses): D-2 (19 sec.); G-1 (23); 
A-3 (25); C-2 (36); E-3 (58); E-5 (60); C-3 (67); H-1 
72); D-1 (79); C-1 (102); A-2 (104); and E-1 (105). 


Procedure 


E, who did not know the anxiety or IQ level of the 
Ss, worked with them individually in the public schools. 
After bringing each S from the classroom to the testing 
room, a brief conversation took place during which the 
establishment of good rapport was attempted. Then 
the following instructions were given 

This is like the game in the Sunday papers where 
you have to find a hidden animal in a picture. Only 
this time you won’t be looking for an animal. This is 

a test to see how fast you can find a design 

E then gave S the practice complex figure for 15 
sec., during which he asked S whether it reminded him 
of anything. The complex figure was then removed, 
and S was given the simple practice figure for 10 sec., 
during which EZ advised S to “learn this one carefully” 
because he would have to locate it in the complicated 
figure he had just seen. Finally, Z removed the simple 
figure and presented the complex figure again, at the 
same time telling S to find the simple figure in the 
complicated one. After S had located the practice 
embedded figure, he was asked to trace its outline with 
a blunt instrument so that E could see that S under- 
stood the task. Then the instructions continued as 
follows 

This is the way the test will be done. You will be 
shown the complicated design and then the little 
simple figure. Finally, you will have to locate the 
simple figure in the complicated one just as you did. 

The simple design will always be present in the com- 

plicated one, and it will always be right side up. 
Also, if you forget the simple figure, you may ask to 
see it again as many times as you wish and it won’t 
count against you since I stop the watch while 
you're looking at the simple figure. And remember, 
there may be some more lines crossing the simple 
figure when it is in the complicated figure. Find the 


* The author is grateful to H. A. Witkin for making 
these data available 


simple figure as fast as you can, since I will be timing 
you and your grade depends on your time, but be 
sure that the figure you find has exactly the same 
size and shape as the one you're supposed to be 
looking for. As soon as you think you have found the 
figure, tell me at once so that I can note your time 
and then show me where it is by tracing its outline 
If you are right, I will give you the time when you 
said you saw it. If you are wrong, you will just go 
on looking and your guess won’t count against you. 
So remember, tell me as soon as you have found the 
figure, and then show me where it is. Any questions? 
E then presented the items in one of two orders, 
depending upon the order group to which S had been 
randomly assigned. If S had been assigned to order 
Easy-Difficult (ED), he was given the figures in the 
order, 1-12. If he had been assigned to order Difficult- 
Easy (DE), the order of administration was reversed— 
S got the figures in order 12-1. 
Each time S requested to see the simple figure again, 
E stopped the watch and recorded the time and a figure 
“S.” The watch was not stopped during a guess, al- 
though when S made a guess the time was recorded, 
and if the guess was incorrect a “G” was also recorded 
after the time. If .S had not located the embedded figure 
within five minutes, this was noted on the recording 
sheet, and S was then shown the correct location of the 
embedded figure 


Scoring 


Latency. The time in seconds taken by each S to 
solve each figure was recorded and then transformed to 
logarithms in order to achieve normally distributed 
item distributions and homogeneity of the variances 
of the 12 groups. The rationale for this use of 
logarithmic transformations has been presented by 
Quenouille (1950). Normality of the item distributions 
was ascertained by observation and the group variances 
were found to be homogeneous according to the Fimax 
test (Walker & Lev, 1953). The logarithms of each S’s 
latency scores on figures D-2 and G-1, A-3 and C-2, 
E-3 and E-5, C-3 and H-1, D-1 and C-1, A-2 and 
E-1, respectively, were summed so that, for purposes of 
the analysis, each S had 6 log-latency scores. In addi 
tion, a total log-latency score was obtained for each S 
by summing his 12 log-latency scores 

Number of figures solved. Each S was also given a 
score consisting of the total number of figures solved 
without regard for the time taken to solve them. Ob- 
servation of the distribution and homogeneity of 
variances of these scores revealed no consequential 
deviation from normality, and thus no need for a 
transformation of scores 

Cautiousness Index. The instructions preceding 
administration of the criterion task included statements 
that S could request to see the original (simple) figure 
as often as he liked and without penalty (he was told 
the watch would be stopped while he looked). In 
addition, he was told that guessing was permitted and 
wouldn’t count against him, although he was not told 
that the watch would be stopped during a guess. He 
was also told that if he made a “wrong”’ guess, he would 
be so informed and could then continue his search for 
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the embedded figure. Thus, the task was structured so 
that a cautious approach in which S made sure that he 
knew the correct response before he made a guess would 
be rewarded by success (since the watch continued 
running during guesses). A ratio consisting of the 
number of requests to see the simple figure divided by 
the number of “wrong” guesses was computed for each 
S. This ratio is relatively independent of latency be- 
cause it was possible for S to obtain high or low ratios 
regardless of the amount of time he took to solve the 
figures. The S who obtains a high ratio score is one who 
asks to see the simple figure many times before he 
ventures a guess, and although there are elements of 
both dependency and cautiousness involvec' in this 
sort of behavior, the ratio score has been labeled the 
Cautiousness Index, because this term seems to pro- 
vide the most accurate description ot the variable 
under study. Since these ratio scores were not normally 
distributed, the logarithmic transformation was again 
utilized successfully to normalize the distribution so 
that / ratios could be computed. 


RESULTS 
Analysis of Latency Scores* 


The results of the analysis of variance of the 
log-latency scores are given in Table 1. Al- 
though the main effects of both Anxiety and 
linear IQ only approached statistical signifi- 
cance, the Anxiety X linear IQ interaction 
was found to be statistically significant 
(p < .05). This interaction resulted from the 
fact that, in comparison with the LA Ss, the 
performance of the HA Ss was superior at the 
low and middle levels of IQ and inferior at the 
high IQ level. As a statistical check on this 
observation, separate t tests were computed 
for the differences between the mean log- 
latency scores of the HA and LA groups at 
each level of IQ. The results revealed that the 
HA performed significantly better than the 
LA at the low IQ level ( < .01) and at the 
middle IQ level (p < .03), whereas the su- 
perior performance of the LA at the high IQ 
level only approached statistical significance 
(.10 < p < .20). The mean log-latency scores 
for the HA and LA groups at each level of IQ, 
as well as the results of the / tests of the dif- 
ferences between these means, are given in 
Table 2. 

With respect to the within-Ss effects listed 
in Table 1, the main effect of Trials and the 
Trials X Order interaction were both found 
to be statistically significant (p < .001) and 

‘ The author wishes to express his thanks to Norman 
Anderson for his helpful advice on the statistical 
analysis 


TABLE 1 
ANALYSIS OF VARIANCE OF LOG-LATENCY SCORES 
SS df MS F P 
Anxiety 3.146 1 3.15 2.23 p< .i5 
10 4.781 2 2.39 
Linear 4.624 1 4.624 3.28 p< .10 
Resid 0.157 1 0.157 
Order 0.005 1 0.005 
Ax IQ 6.843 2 3.42 2.43 p< .10 
Linear 6.388 1 6.39 4.53 p< .05 
Resid 0.455 1 0.455 
Axo 2.024 1 2.024 
6 X IQ 2.005 2 1.002 
AxoOx IQ 1.767 2 0.884 
Error 50.791 36 1.41 
Within Trials 51.096 5 10.219 14.45 p< .001 
(task diffi- 
culty) 
TXA 0.765 5 0.153 
Txo 15.847 5 3.169 4.48 p< .001 
TX IQ 2.542 10 0.254 
TXAxO 2.876 5 0.575 
TXAxXIQ 3.279 10 0.328 
TxOxIQ 3.858 10 0.386 
TxXAxO 2.635 10 0.264 
x 1Q 
Error 127.242 180 0.707 
TABLE 2 
MEAN LOG-LATENCY SCORES FOR ANXIETY—IQ Groups 
HA LA SEDm i df ?” 
LIQ 22.29 25.39 0.79 3.92 14 <.01 
M IQ 21.65 23.58 0.75 2.57 14 <.03 
H IQ 22.61 21.34 0.88 1.44 14 


.10< p<. 


* Al 


significance tests two-tailed 


to account for the major portion of the 
within-Ss variance. The Trials X Order inter- 
action resulted from the fact that both the 
LA and the HA Ss did better on the items 
which came last in sequence. The Trials effect 
showed that the criterion task items did differ 
in level of difficulty. Duncan’s Test (Snedecor, 
1956) was carried out on the six over-all mean 
log-latency scores for Trials in order to deter- 
mine more precisely the extent of the difference 
in task difficulty level of the criterion task 
items. Since there was no Anxiety X Trials 
interaction, the overall MS within was used in 
computing SE,, for Duncan’s Test. The results 
of the 14 simultaneous comparisons indicated 
that, with the exception of the comparison of 
the first pair (D-2 and G-1) with the second 
pair (A-3 and C-2), each of the six pairs of 
figures differed significantly in task difficulty 
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TABLE 3 
MEAN NuMBERS OF FiGurRES SOLVED FoR ANXIETY— 
IQ Groups 





HA LA SEDm ‘ df »*° 

LIQ 8.88 6.38 0.71 3.52 14 <.01 
M IQ 9.50 7.88 0.63 2.57 14 <.05 
H IQ 8.62 8.75 0.77 0.17 14 NS 
7.67 0.43 3.09 46 <.01 


Total 9.00 


* All significance tests two-tailed 


from every other pair according to Duncan’s 
Test with the 99% protection level. 


Analysis of Number Solved 

The ¢ testing technique was utilized for 
testing the difference between the mean num- 
ber of figures solved by the two anxiety groups, 
as well as for testing the differences between 
the means of these groups at each IQ level. 
The results of the / tests are given in Table 3. 
They show that the HA solved significantly 
more figures than the LA (p < .01), although, 
as in the analysis of the latency scores, the 
tests of the differences between the means at 
each IQ level show that the superior perform- 
ance of the HA was limited to the low and 
middle levels of IQ. In addition, and in con- 
trast with the results of the analysis of the 
log-latency scores, the superiority of the LA 
Ss at the high IQ level in the present analysis 
was not found to approach statistical signifi- 


cance. 


Multivariate Analysis of Log-latency and Num- 

ber Solved 

A statistical procedure given by Rao (1952) 
was used in carrying out a simultaneous multi- 
variate analysis of both log latency and num- 
ber of figures solved. The only significant 
effects to result from this analysis were the 
main effect of Anxiety (p < .05) and the 
Order X IQ interaction (p < .05). That is, 
when both variables are allowed to operate 
simultaneously in the same analysis, the 
Anxiety X IQ interaction drops out, and the 
main effect of Anxiety becomes significant, 
with the HA showing superior over-all per- 
formance. It is apparent that when these two 
variables are combined in an analysis of this 
type, the marked superiority of the HA Ss at 
the low and middle IQ levels overrides the 
Anxiety X IQ interaction found in the analysis 


of variance of the log-latency scores. The 
Order X IQ interaction resulted from the fact 
that the high and low IQ groups performed 
more efficiently when they were given the easy 
figures first, whereas the middle IQ groups 
performed more efficiently when given the 
more difficult figures first. 


Analysis of Cautiousness Index (CI) 


After collapsing the groups on IQ, the mean 
log CI score for the 24 HA Ss was .174, whereas 
the mean log CI score for the 24 LA Ss was 
.056. The SE of the difference between these 
means was .054, and the ¢ ratio was 2.19 with 
46 degrees of freedom. This ratio is significant 
at the p < .02 level of significance (one-tail). 
The LA mean log CI score was raised con- 
siderably by the fact that one LA S obtained 
a log CI score of 1.061. If the highly deviant 
score of this LA S had been excluded, the 
mean log CI score for the LA group would 
have dropped to .011. In any event, it remains 
clear that the HA group was significantly more 
cautious than the LA group. 

The groups were also collapsed on Anxiety 
in order to determine whether the three IQ 
groups also differed in degree of cautiousness. 
The mean log CI score for the 16 high IQ Ss 
was .101; for the 16 middle IQ Ss, .123; and 
for the 16 low IQ Ss, .121. Since the differences 
between these means were not found even to 
approach statistical significance, it was con- 
cluded that the three IQ groups did not differ 
in degree of cautiousness. 

For purposes of further analysis and com- 
parison, it was decided to define the highly 
cautious S (whether HA or LA) as one with 
a raw S-over-G ratio’ >2 (log S-over-G ratio 
> .301) because there was a break in the total 
distribution of cautiousness scores between the 
17 Ss with raw S-over-G ratios >2 and the 
31 Ss with S-over-G ratios <2. According to 
this criterion of high and low cautiousness, 
there were 11 HA Ss and 6 LA Ss who were 
defined as being highly cautious. The mean 
log-latency scores, the mean number of figures 
solved, and the mean log CI scores of the high 
and low cautious Ss—both within and across 
Anxiety groups—are given in Table 4. It is ap- 
parent that the HA highly cautious Ss solved 
more figures (p < .001) in less time (p < .001) 

5 This ratio is defined in the section describing the 
scoring procedure. 
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TABLE 4 
————_—_—_ 
Lee» |No. Solved cI 
HA _— Cautious | 20.928 | 10.27 | 0.493 
Not Cautious | 23.244 | 7.92 |—0.095 
Mean Diff. 2.316*| 2.35*| 0.588" 
LA Cautious | 21.156 | 9.67 | 0.512 
Not Cautious | 24.197 | 7.00 |—0.097 
Mean Diff. | 3.041% 2.67*| 0.609* 
Total Cautious | 21.042 | 10.06 | 0.502 
Not Cautious | 23.721 | 7.39 |—0.096 
Mean Diff. | 2.679*| 2.67* | 0.598* 





* Significant at the p < .001 level (two-tailed). 


TABLE 5 


Mean GA anv TA Scores oF HicH anp Low TA, 
Cautious AND Noncavtious Ss 








HA 











Cautious Noncautious Diff. 
TA 18.2 19.8 1.6 
GA 33.7 16.5 0.8 
LA 
TA 3.0 2.6 0.4 
GA 10.0 6.3 3.7° 





* Significant p at < .01 level (22 df) 

than the HA low cautious Ss. Similarly, the 
LA highly cautious Ss also solved more figures 
(p < .001), in less time (p < .001) the than 
LA low cautious Ss. Furthermore, inspection 
of the data in Table 4 reveals that the per- 
formance of the 11 HA highly cautious Ss did 
not differ appreciably from the performance 
of the 6 LA highly cautious Ss, nor did the 
performance of the 13 HA low cautious Ss 
differ appreciably from that of the 18 LA low 
cautious Ss. Finally, when the groups were 
collapsed on Anxiety and IQ, the highly cau- 
tious Ss were found to solve more figures 
(p < .001) in less time (p < .001) than the 
low cautious Ss. 

Since General Anxiety (GA) scores from the 
Yale General Anxiety Scale for Children were 
also available for all of these Ss, the mean GA 
scores as well as the mean TA scores for the 
highly cautious and low cautious Ss within 
TA groups were computed in an attempt to 
shed some light upon the fact that, contrary 
to expectations, six of the low TA Ss were also 
highly cautious. T tests revealed that the mean 


GA score for the six highly cautious low TA 
Ss was significantly higher than the mean GA 
score for the 18 low cautious low TA Ss 
(p < .01), even though their TA scores were 
not significantly different. On the other hand, 
the mean GA and TA scores of the highly 
cautious and low cautious high TA Ss were 
not significantly different. These means are 
listed in Table 5. 


DISCUSSION 


The prediction that highly cautious Ss do 
better than low cautious Ss on the criterion 
task is confirmed by the findings. This sup- 
ports the original analysis of the embedded 
figures task as one in which cautious behavior 
is an asset in problem solving. 

The prediction that HA Ss obtain signifi- 
cantly higher cautiousness scores than LA Ss 
is also confirmed, consistent with the thesis 
that (a) in the case of the test anxious S, the 
evaluative characteristic of tests has become 
connected with certain unconscious material 
involving his early interpersonal relationships 
and conflicts with important authority figures; 
(6) as a consequence, tests have become 
equated with danger and thus trigger his 
anxiety; (c) the test anxious child gradually 
has learned certain methods of defending 
against the anxiety engendered in him by 
evaluative situations; (d) these defenses include 
the type of behavior here labeled ‘‘cautious- 
ness”; and (e) because these defenses have 
been strongly overlearned, they have become 
“automatic” responses to the anxiety triggered 
off by tests. 

The prediction that the performance of HA 
Ss on the criterion task would be significantly 
better than that of the LA Ss is confirmed by 
the results of the multivariate analysis. How- 
ever, despite the fact that this analysis ac- 
counted for both the control and criterion 
variables simultaneously, more detailed analy- 
sis revealed that the superiority of the HA Ss 
was limited to the low and medium level of 
IQ. For these IQ groups, then, wherever the 
tasks in a test situation are defined or struc- 
tured so that cautiousness, as here defined, is 
a positive asset in solving the tasks; the test 
anxious child will do well. It is in this sense 
that the results show that anxiety is not al- 
ways disorganizing, but that under certain 
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conditions it may be facilitating. It is im- 
portant to emphasize, however, that the su- 
perior performance expected of the HA child 
in some situations is a “fortuitous” result of 
defensive reactions to his anxiety over which 
he has little conscious control, and thus does 
not at all imply that he will do well in all prob- 
lem solving areas. That is, in problem solving 
areas where his stable defensive reactions con- 
flict with the type of behavior required for 
solution of the problem, they are expected to 
function as a liability rather than as an asset. 

The results of the analysis of the cautious- 
ness scores suggest that cautiousness was in- 
deed the defensive reaction to anxiety which 
served as a mediating variable between anxiety 
and performance on the criterion task. It is 
relevant that the 6 Ss who were low on TA 
and highly cautious had significantly higher 
GA scores than the 18 Ss who were low on TA 
and less cautious. This seems to constitute 
further support for the hypothesized strong 
relationship between anxiety and cautiousness, 
since in view of their higher GA scores, it is 
more likely than not that the six low TA, 
highly cautious Ss may also be anxious about 
tests, although they did not admit it on the 
TA questionnaire. Sarason et al. (1958b) dis- 
cussed the problem of differentiating among 
Ss who are really not anxious about tests and 
those who are test anxious, but who cannot or 
refuse to admit their anxiety. It is also relevant 
to the present discussion that in a Rorschach 
study (Sarason et al., 1958b), it was found 
that the HA asked more questions of the 
examiner and rejected more cards than did the 
LA. In other words, their response to the test 
situation was to become dependent or cau- 
tious. However, this kind of behavior was not 
rewarded in the Rorschach situation. S got no 
help from the examiner and no reward for 
cautious behavior in terms of success and 
failure. In the Embedded Figures situation 
used in the present study, on the other hand, 
cautious behavior was rewarding because the 
cautious S could ask to see the simple figure 
again without incurring the disapproval of the 
examiner (and stop the watch in so doing) and 
could easily tell when his behavior resulted in 
success. Thus, with these particular instruc- 
tions and this type of task, the cautious be- 
havior of HA Ss facilitates their performance. 
In summary, the findings of the present study 


agree with certain past findings which support 
the generalization that the effect of anxiety on 
performance, whether facilitating or inter- 
fering, is mediated primarily by defensive re- 
actions to the anxiety. It follows that two 
research problems of major importance are (a) 
the detection and detailed theoretical and ex- 
perimental analysis of common defensive reac- 
tions to anxiety and (}) the detection and 
detailed theoretical and experimental analysis 
of the type of response requisite to success in 
a wide variety of task-instruction situations. 
Such research would improve our ability to 
predict the problem-solving behavior of chil- 
dren with different personality structures in 
various types of problem-solving situations. 

The significant interaction between Anxiety 
and IQ yielded by the analysis of variance of 
the log-latency scores was not predicted in ad- 
vance. The interaction resulted from the fact 
that the performance of the HA was superior 
to that of the LA at the low and middle levels 
of IQ and inferior at the high IQ level. One 
tentative explanation is that the LA Ss who 
were highly intelligent were able to perceive 
that this situation called for a specific type of 
approach and performed accordingly, whereas 
the less intelligent LA Ss were not this ana- 
lytic. To the extent that this is valid, the low 
and medium IQ HA Ss, in contrast to their 
LA counterparts in intelligence, have an initial 
advantage in a situation in which cautious 
behavior is an asset because they characteris- 
tically tend to react with caution in all evalua- 
tive situations in order to defend against their 
anxiety. 

It is well known that the behavior of two 
Ss may appear to be the same in a specific 
situation although the behavior is a neurotic 
symptom in one case and not in the other. 
Just as close observation of such cases over 
time enables us to differentiate the neurotic 
behavior from that which is more flexible, it is 
possible that the type of cautiousness utilized 
by the LA highly intelligent group may be 
qualitatively different from that which charac- 
terizes the behavior of the HA group. That is, 
whereas cautiousness as an anxiety reaction 
may dominate all aspects of the response to 
the anxiety-arousing situation, regardless of 
appropriateness, the LA child may be able to 
limit this type of behavior to those situations 
which he can recognize as requiring this type 
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of response; and, in contrast with HA Ss at all 
levels of IQ, he may also be able to combine 
speed of response with his cautious approach. 
In any event, this particular aspect of the 
findings suggests that the differential effects 
of anxiety upon performance—as mediated by 
defensive reactions—may vary systematically 
depending upon both intelligence level and 
type of task and instructions. 

The statistically significant Trials X Order 
interaction found in the analysis of variance 
of the log-latency scores is a reflection of the 
fact that both the HA and LA performed much 
better on whatever came last in sequence, 
regardless of order of task difficulty. This result 
is interpreted to mean that there was a large 
learning effect operating within trials, with 
both the HA and LA learning with about equal 
facility. 

The fact that no Anxiety X Difficulty inter- 
action was found in the present study is rele- 
vant in at least two respects. First, it is inter- 
preted as supporting the hypothesis that with 
respect to the effect of anxiety upon perform- 
ance, qualitative differences in type of task, 
task instructions, and stable defensive reac- 
tions to the anxiety are more important than 
quantitative differences in the level of diffi- 
culty of the tasks. Second, the absence of such 
an interaction—despite the fact that the 
analysis also showed clearly that the variation 
in task difficulty was extremely large—may 
be of theoretical significance in view of the 
fact that those who conceive of anxiety as a 
simple “motivational drive” rely upon the 
presence of an interaction between anxiety and 
task difficulty for support of their theory 
(Castaneda, Palermo, & McCandless, 1956). 

Finally, in view of the success of the Yale 
Test Anxiety Scale for Children in differen- 
tiating between efficient and inefficient per- 
formers at several levels of intelligence and 
between high and low cautious Ss, the results 
contribute to the validity of the scale for pre- 
dicting the problem solving behavior of groups. 
However, it remains clear that the size of the 
group differences is not so great that one may 
confidently predict the behavior of individual 
Ss. 


SUMMARY 


Two anxiety scales and an intelligence test 
were administered to 280 sixth-grade boys. 


Forty-eight Ss, divided into 12 groups in a 2 
(high and low anxious) X 3 (high, medium, 
and low IQ) X 2 (order of administration of 
easy and difficult items) design, were indi- 
vidually administered an embedded figures 
task. The task items ranged from extremely 
easy to extremely difficult. Three scores were 
derived from the criterion task for each S: a 
latency score, number of figures solved, and a 
cautiousness index. Three predictions were 
made: (a) Highly cautious Ss do better than 
low cautious Ss on the criterion task; (6) HA 
Ss obtain higher cautiousness scores than LA 
Ss; (c) The performance of HA Ss on the cri- 
terion task is superior to that of the LA Ss. 
The first two predictions were confirmed. The 
third prediction was confirmed for Ss at the 
low and medium IQ levels. The analysis of 
variance of the latency scores yielded a sig- 
nificant Anxiety X linear IQ interaction 
which resulted from the fact that the perform- 
ance of the HA was superior to that of the LA 
at the low and middle levels of IQ and in- 
ferior at the high IQ level. The results of the 
analysis of the cautiousness scores were inter- 
preted as supporting the thesis that the effect 
of anxiety on performance, whether facilitating 
or interfering, is mediated primarily by de- 
fensive reactions to the anxiety. It was also 
concluded that the differential effects of 
anxiety upon performance may vary system- 
atically depending upon both intelligence level 
and type of task and instructions. 
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THE EFFECT OF “SOCIAL CENSURE” AND “SOCIAL APPROVAL” ON 
THE PSYCHOMOTOR PERFORMANCE OF SCHIZOPHRENICS! 


DAVID K. CAVANAUGH?®, WALTER COHEN, anp PETER J. LANG 


University of Buffalo 


N RECENT years many investigators have 
been concerned with the effects of special 
motivational conditions on the psycho- 

logical deficit displayed by schizophrenics. 
Some workers have introduced into the experi- 
mental situation stimuli which may be de- 
scribed as rewarding; others have utilized 
socially or physically aversive stimuli. All of 
these conditions have resulted in changes in 
performance, and special claims have been 
made for the effectiveness of one or the other 
of the motivational conditions. Thus, Pascal 
and Swenson (1952), Cohen (1956), Cavanaugh 
(1958), and others (Buss, Weiner, & Buss, 
1954; Lang, 1959) report a decrease in psycho- 
logical deficit under conditions of punishment. 
Conversely, Stotsky (1957), Peters and 
Jenkins (1954), and others report that rewards 
are effective in reducing the schizophrenic’s 
impairment. Rodnick and Garmezy (1957) 
have conducted an extensive research program 
designed to examine the effect of social punish- 
ment and reward on the performance of schizo- 
phrenic Ss. These authors conclude from their 
research that even “‘a minimal punishment” 
has a deleterious effect on the performance of 
schizophrenics on a wide range of tasks. An 
alternative interpretation of their data is that 
it indicates simply that punishment is an 
effective modifier of schizophrenic behavior. 
The direction of the modification is not always 
in a negative direction. When a punishment is 
of a general noninformational nature, deterio- 
ration is most readily observed. When punish- 
ment, however, is associated with specific 
kinds of responses, a reduction in the frequency 
of those responses is observed. In Rodnick 
and Garmezy’s research, it is possible to re- 
interpret some of their results as not neces- 
sarily indicating decrement. The arbitrary 
evaluation of the reduction in responses as 

! This study is part of a research program supported 
by United States Public Health Service Grant M-2173, 
“Motivation and improved performance in schizo- 
phrenia.”’ 

* Chief Psychologist, Buffalo State Hospital 

3 Now at the University of Pittsburgh 
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improvement or decrement is a function of the 
specific task and measures utilized by the ex- 
perimenter. The studies that suggest that 
punishment may result in improvement in 
both psychomotor and concept formation 
tasks use an escape paradigm, whereas most 
of the studies cited by Rodnick and Garmezy 
(1957)—where punishment resulted in decre- 
ment—use an avoidance design. 

In recent research by Lang (1959) and Buss 
(1954), information was conveyed to schizo- 
phrenics about the inadequacy of their re- 
sponses. In this avoidance situation, the 
schizophrenics’ performance improved. How- 
ever, in this research the stimulus used to 
convey information was neutral rather than 
noxious. The question that arose from con- 
sideration of the several studies was whether 
a noxious stimulus which was presented to S 
(with useful information) would lead to im- 
provement, as in Lang’s experiment (1959), or 
to decrement, as with Rodnick and Garmezy 
(1957). 


EXPERIMENT I 


In order to examine these propositions in 
the present study, information about an inade- 
quate response was introduced into a reaction 
time situation in two ways. One group of 
schizophrenic Ss was presented with a neutral 
tone whenever the S’s response was “too 
slow.” A second group was verbally censured 
when they responded slowly. On the basis of 
previous experimental findings and a critical 
evaluation of the Rodnick and Garmezy ma- 
terial, the following hypotheses were de- 
veloped: (a) Ss who receive information about 
their responses under neutral conditions show 
greater improvement in reaction time than Ss 
who do not receive such information; (6) Ss 
who receive information about their responses 
under conditions of censure show greater im- 
provement in reaction time than Ss who do 
not receive information; (c) Ss who receive 
information about their performance show 
greater improvement than those who do not. 
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Procedure 


Ss for this experiment were 36 male, chronic schizo- 
phrenic patients, selected from the in-patient popu- 
lation of the Veterans Administration Hospital, 
Canandaigua, New York.‘ They ranged in age from 26 
to 40 years (mean = 33.64), and all had been hospi- 
talized more than one year. All Ss had attended high 
school for at least one year (mean years education = 
12.11). No S was retained in the sample who had a 
history of primary mental deficiency or organic cerebral 
disease. 

A disjunctive reaction time apparatus described 
elsewhere (Lang, 1959), provided the experimental 
task. Ss were instructed to respond to the illumination 
of one of two neon lights by executing a 10.5 in. “jump” 
with the hand to one of two standard telegraph keys. 
The “jump” interval was recorded to the nearest .01 
sec. by a Standard Electric Timer 

In addition to the lights, sound stimuli were intro- 
duced under two conditions of the present experiment. 
These stimuli were prerecorded on separate tape “loops” 
and delivered to S through earphones from a tape 
recorder which formed part of the apparatus. The time 
of presentation and duration of the sound stimuli were 
controlled by two Hunter Decade Interval Timers. 

The initial instructions and procedure were es- 
sentially the same as those previously described by 
Lang (1959). Ss received a total of 63 trials at the 
apparatus. These were divided into three series of 21 
trials each—preinformation, information, and post- 
information. Each series was administered in three 
blocks of seven trials with a .45 sec. rest between each 
block. 

The preinformation trials were conducted under the 
usual conditions of administration for a reaction time 
RT) task 

On the basis of their performance during this series, 
Ss were divided into three matched groups: Control 
(CO); Tone (TN); and Censure (CN). This initial 
matching was based on each S’s fastest median score 
for any block of seven reaction times. A matched pairs 
analysis of variance based on median scores for the 21 
preinformation trials revealed that differences be 
tween the three groups did not approach significance 
(F = .06, df = 2 and 11).5 

Depending on group assignment, Ss _ performed 
during information trials under one of the following 
conditions: 

1. TN. A 300 cps tone was presented when the S 
pressed the correct key, if the S responded after a 
previously established time interval. Depression of 
the key before the termination of the interval would 
not activate the tone. The time interval, from onset 


‘The authors are grateful to the professional staff 
of the VA Hospital, Canandaigua, N. Y. for their 
assistance in supplying the facilities necessary for the 
completion of the study 

§ Ss who did not achieve a median reaction time for 
any block of seven preinformation trials of less than 
75 sec. were not retained in the sample. Previous 
research (Lang, 1959) raised doubts that Ss with 
slower scores were responding to the experimental 
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of the light to the time when pressure on the key 

would activate the tone, was determined for each 

block of seven trials by the S’s median RT for the 
previous block of seven trials. 

2. CN. The verbal message, “That was bad—too 
slow,”’ delivered authoritatively by a male voice, 
was presented in the same manner as that described 
above for the tone. The duration of both the tone 
and the message was 5.5 sec. Both were presented at 
low but clearly audible intensity. 

3. CO. No additional stimuli were introduced into 
the experimental situation. 

Prior to the information trials, Ss were instructed 
appropriately, depending on condition assignment. 
Members of the TN group were told that they would 
hear a tone through the earphones if they were “too 
slow” in responding to the lights. The CN Ss were told 
that a voice would inform them through the earphones 
if they were “too slow.” To assure the fact that Ss did 
not forget the meaning of the voice or tone, they were 
reinstructed between each block of seven trials that 
the tone or voice indicated that their previous response 
had been too slow. These instructions were framed in a 
nonpunitive fashion. Ss were told that the voice or 
tone was designed to help them to respond more 
quickly by providing information about their per- 
formance. 


Results 

Performance during postinformation and 
information trials was analyzed in terms of 
decrement scores. These were obtained by 
separately subtracting each S’s median score 
for the information and postinformation trials 
from his median preinformation trials score. 
The Wilcoxon test for paired replicates 
(Seigel, 1956) was used to evaluate differences 
between the three experimental groups. 

The results, reported in Table 1, confirmed 
the previous finding (Lang, 1959): schizo- 
phrenic Ss who receive information about their 
performance under neutral conditions (TN) 
show significantly (.05 level) greater improve- 


TABLE 1 
Witcoxon Marcuep-Parrs SIGNED-RANKS TEST 
ANALYSIS OF DIFFERENCES BETWEEN CONTROL 
AND INFORMATION CONDITIONS 


Trials Series 
Groups tested 


Information Postinformation 


CO—TN 11.0° 16.0 
CO—CN 6.0* 12.0° 
TN—CN 21.0 29.5 
Note.—Entries in this table are the lowest sums by sign 
+ or —) of ranked differences between paired replicates. For an 
NV of 12 (as in this case) a sum <14 = p < .05 
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Fic. 1. Mean reaction time for the control and 
information conditions 


ment than Ss who do not receive such informa- 
tion (CO). Furthermore, the CN group also 
improved significantly (.01 level) more than 
did the control Ss, and this group maintained 
the improvement during the postinformation 
trials. No significant differences were found 
between the CN and TN groups for either in- 
formation or postinformation trials. The mean 
RTs for the experimental groups at each stage 
of performance are presented graphically in 
Fig. 1. 
EXPERIMENT II 

Experiment ITI was designed to examine the 
differential effects of social rewards and cen- 
sure on the disjunctive reaction time (RT) 
performance of chronic schizophrenic Ss. Like 
the Rodnick and Garmezy studies (1957), 
both the reward and censure conditions were 
such that Ss received information about their 
responses. Three groups of Ss were employed. 
One group was informed in a punitive manner 
that their response had been “too slow” when- 
ever their performance fell below a prede- 


termined score. A second group of Ss was 
informed that they were performing well when- 
ever their response latency was faster than 
a similarly determined standard. A third group 
received no specific information regarding the 
adequacy of their performances. 

Experiment I indicated that schizophrenic 
Ss who receive information about their per- 
formance under censure conditions show 
greater improvement in RT performance than 
Ss who do not receive information. The con- 
tribution of information in a setting of social 
approval or reward was considered in Experi- 
ment II. It was hypothesized that censure 
would not be less effective than approval in 
this setting. 


Procedure 


Ss for Experiment II were 27 male, chronic schizo- 
phrenic patients selected from the in-patient population 
of the Veterans Administration Hospital, Canandaigua, 
New York. They ranged in age from 25 to 39 years 
(mean = 32.89), and all had been hospitalized more 
than one year. All Ss had attended high school at least 
one year (mean years education = 11.65). No S was 
retained in the sample who had a history of primary 
mental deficiency or organic cerebral disease. 

The apparatus was the same used as that used in 
Experiment I with the addition of a new tape “loop” 
for the reward group. A similar procedure was also 
used. As in Experiment I, on the basis of pre-infoma- 
tion trials performance, Ss were divided into three 
matched groups,’ (F = .15, not significant for matched 
pairs analysis of variance of preinformation group 
means) 

During the information trials, Ss in groups CN and 
CO (censure and control) performed the reaction time 
under the same conditions as in Experiment I. In the 
present experiment the third group, Reward (RE), 
performed the information trials reaction time under 
the following conditions: When the S’s response was 
“fast enough,” the following message was delivered to 
the S through the earphones: “That was good—very 
fast.” The voice was the same as that used for 
the punishment condition. The interval during which 
the striking of the correct response key would lead to 
the presentation of the message was determined by the 
S’s median reaction time score for the previous seven 
trials. Whenever S’s reaction time was faster than the 
median, the message was automatically delivered. 
Whenever S was slower than this score no additional 
information was presented. 

The information variable was successfully con- 
trolled so that the (CN) and (RE) groups received 
approximately the same number of messages.® Prior 
to the information trials, Ss were told about the message 
they would hear through the earphones, depending on 


* The mean number of messages received by Ss in 
the experimental groups were 9.8 for the CN group and 
10.2 for the RE group. 
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the particular experimental condition. The type of 
instruction was similar to that used in Experiment I 
js were reinstructed between each block of seven trials 
to assure that they would not forget the meaning of the 
verbal message. They were told that the voice was 
intended to help them respond more quickly 


Results 


The difference between a S’s median reac- 
tion time for the preinformation trials and his 
score for information trials provided a measure 
of each S’s improvement. Groups were ana- 
lysed in terms of these scores, using the 
Wilcoxon test for paired replicates (Seigel, 
1956). The results are reported in Table 2. 
Surprisingly, there was no evidence of differ- 


TABLE 2 
Witcoxon MatcHep-Parrs SIGNED-RANKS TEST 
ANALYSIS OF DIFFERENCES BETWEEN CONTROL 
AND INFORMATION CONDITIONS 
Trials Series 


Groups tested 


Information Postinformation 








RE—CN 3.0* 5.5} 
RE—CO 15.0 14.5 
CN—CO Lo 3.0° 
Note.—Entries in this table are the lowest sums by sign (+ 
or of ranked differences between paired replicates. For an 
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Fic. 2. Mean reaction time for the control, approval 
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ence between the improvement scores of con- 
trol and reward Ss for the information series. 
As in Experiment I, Ss performing under the 
CN condition displayed significantly greater 
improvement than the control group (and also 
the RE group). 

The performance of Ss under the postinfor- 
mation condition were similarly analysed, 
using as the S’s score the difference between 
median preinformation performance and that 
achieved during postinformation trials. The 
CN group again proved to be significantly dif- 
ferent from both the Reward and Control 
groups. The difference in improvement be- 
tween the CO and RE groups was not signifi- 
cant. The mean RTs for the experimental 
groups at each stage of performance are pre- 
sented graphically in Fig. 2. 


DISCUSSION 


The results of these experiments suggest 
that social censure can be an effective positive 
modifier of schizophrenic behavior. This find- 
ing is consistent with Buss et al. (1954) and 
others (Cavanaugh, 1958; Lang, 1959; Pascal, 
1952), but not in agreement with Rodnick and 
Garmezy’s (1957) interpretations. It was sug- 
gested in the introduction that even the results 
obtained by Rodnick and Garmezy could 
logically be interpreted in some instances as 
not indicating decrement. A good example of 
this is found in Garmezy’s basic study of 
stimulus generalization, where he reports that 
schizophrenics were no different from normal 
Ss until social censure was introduced. With 
censure box) flashing “‘wrong’’) the normals 
presented a steepening response gradient, 
while no essential change was noted in the re- 
sponse gradient of the schizophrenic Ss. How- 
ever, Garmezy’s procedure involved a differen- 
tial reinforcement of “push” and “pull” 
responses. That is, the only response for which 
any S was censured occurred when the S indi- 
cated that the most dissimilar tone was the 
same as the training stimulus. Thus, only 
“pull” responses were punished. These results 
indicate that the schizophrenics showed a re- 
duction of pull responses. It is possible to 
interpret these results as indicating that S 
learned to avoid punishment. However, 
Rodnick and Garmezy prefer to interpret it in 
terms of a reduction of discrimination, i.e., a 
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flattening of the generalization curve. Similar 
methodological questions may be raised with 
regard to a number of other studies cited by 
Rodnick and Garmezy (1957). 

The present results suggest that it is not 
necessarily censure, as such, which is crucial to 
the schizophrenic’s change in behavior (i.e., 
the censure and neutral tone groups were not 
significantly different in degree of improve- 
ment). It would appear that the more im- 
portant factor was the information about in- 
adequate responses itself, rather than the way 
it was conveyed. 

Bugelski (1956) has stressed the importance 
of “‘knowledge of results” in any task in which 
improvement is to be expected with practice, 
particularly as this knowledge modifies the 
S’s “self instruction” or “set’’ to respond. For 
most tasks, the normal S will readily “‘recog- 
nize the correctness or wrongness of his re- 
sponses as soon as they occur, and no outside 
agency is required” (Bugelski, 1956). The 
spontaneous verbalizations of normal Ss in a 
previous study (Lang, 1959) suggest that this 
is valid for the experimental task used in this 
research. However, the work of a number of 
writers implies that it is not true for schizo- 
phrenic Ss. Schizophrenics appear to be more 
distractable (Cameron, 1947; Chapman, 1956), 
and less able than normals to maintain a set 
to respond to a class of stimuli (Shakow, 1950). 
Thus, additional informational cues introduced 
by the EZ may be of much greater importance 
to psychotic Ss. From this point of view, 
censure, as such, may be less crucial to schizo- 
phrenic deficit than aspects of the experi- 
mental situation which tend to modify set or 
distract or focus the S on the task. 

The results of Experiment II suggest that 
reward (in an informational sense) may have 
little tangible effect upon the reaction time 
performance of chronic schizophrenic subjects. 
Nevertheless, the inference that reward is never 
effective with schizophrenics is not a necessary 
conclusion from the present findings. Stotsky 
(1957) reported some improvement of RT 
performance under “reward” (supportive 
urging) conditions with schizophrenics. It is 
assumed that the present use of reward helped 
gain the cooperation and interest of the Ss. In 
addition, however, it was aimed more precisely 
at modifying specific responses as they oc- 
curred. 


It is possible that with a reward condition 
such as that used in the present design, the 
function of verbal approval is to confirm an 
on-going performance without necessarily 
leading to any change in set. In the censure 
condition, however, S is given a clear indica- 
tion that he is performing inadequately, and 
in a sense, instructions to do better are im- 
plicit. The effects of these differential qualities 
of approval and censure, specifically as they 
modify the S’s set to respond, may account for 
the present findings. This interpretation is 
consistent with the gradual improvement of 
the RE group noted in Fig. 1, as opposed to 
the rapid acceleration of the CN group. Dif- 
ferential implications of reward and punish- 
ment for the Ss’ set to respond may also ac- 
count for the hierarchy of effective reinforcers 
found by Buss et al. (1954) and for the find- 
ings of Rodnick and Garmezy (1957). 

In a recent study of reaction time perform- 
ance with brain damaged patients, Blackburn 
(1958) reports that Ss who were informed 
about their performance and told they could 
do better improved significantly more than Ss 
who were told that they had done well. Al- 
though other factors were involved, this find- 
ing tends to support the interpretation sug- 
gested by the present investigation. Black- 
burn’s results point up the desirability of 
studies comparing schizophrenic and brain- 
damaged patients on the present task, as well 
as tasks used by others in the investigation of 
deficit and schizophrenia. 

In general, the present findings indicate the 
need for examining rather extensively the spe- 
cific qualities and implications of the stimuli 
one labels as rewarding or punishing. Such ex- 
amination may well benefit from reference to a 
theoretical framework which includes con- 
sideration of the role of reinforcers in modify- 
ing and strengthening S’s set to respond. 


SUMMARY 


The present study was designed to examine 
the effects of information about inadequate 
responses on the disjunctive reaction time of 
chronic schizophrenic Ss. Four experimental 
conditions were introduced: control, neutral 
information, censure, and approval. Under the 
neutral information condition, a “‘slow” re- 
sponse (based on a previous estimate of S’s 
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performance) resulted in the brief presenta- 
tion of a 300 cps tone, delivered to S through 
earphones. Under censure conditions, a slow 
reaction time (relative to previous perform- 
ance) resulted in the presentation of a verbal 
message, disapproving of the S’s performance. 
A fast reaction time was followed by verbal 
approval. 

Analysis of the data revealed that both the 
neutral information and censure conditions 
led to an improvement in the reaction time 
significantly greater than that observed among 
Ss who did not receive additional information. 
No significant difference was obtained between 
the censure and neutral information groups. 
The approval or “‘reward” condition had little 
effect upon Ss’ performances as compared 
with controls. 

The findings were compared with other 
studies where conflicting evidences regarding 
the role of reward and punishment were dis- 
cussed. The findings were interpreted as evi- 
dence against the notion that social censure is 
necessarily productive of deficit. An interpre- 
tation of the relative effectiveness of approval, 
censure, and information was offered based 
upon the concept of set. 
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COGNITIVE AND EMPATHIC FACTORS IN ROLE-TAKING 
BY SCHIZOPHRENIC AND BRAIN DAMAGED PATIENTS' 
NORMAN A. MILGRAM? 


Boston University 


HIS study investigates deficiencies in 
role-taking on the part of schizophrenic 
patients and  nonpsychotic brain 
damaged patients. Cronbach (1958) and others 
(Gage, 1955; Bronfenbrenner, 1958) have cited 
the theoretical and methodological problems 
encountered in research on role-taking and 
interpersonal perception. The definitions and 
operations employed in this study were selected 
in an effort to avoid such difficulties. Role- 
taking is defined as an implicit, empathic 
process whereby a person predicts the behavior 
in a given situation of another person or 
persons. According to Taft (1955) role-taking 
is said to involve two different types of 
empathy, individual empathy and mass 
empathy. The former is an intuitive process 
by which one experiences the thoughts and 
feelings of another person. The latter is an 
inferential process whereby one uses socially 
shared, conventional frames of reference about 
groups and group members as a basis for 
predicting their behavior. The present study is 
concerned with the analytic judgments re- 
quired in mass empathy, and investigates the 
ability of subjects to identify the verbal re- 
sponses of men, women, adults, and children in 
multiple-choice word association tests. 
Previous studies in this area (Cline, 1955; 
Dymond, 1950; Sarbin, 1954; Wittick, 1955) 
have shown a positive relationship between 
empathic ability and psychological adjust- 


! This paper is based on a dissertation submitted 
in partial fulfillment of the requirement for the degree 
of doctor of philosophy at Boston University, 1958. 
Sincere thanks are extended to Austin Berkeley, Henry 
Weinberg, and Harold Goodglass, who served as 
members of the author’s thesis committee. The writer 
is indebted to C. Bennett and B. Stotsky who con- 
tributed to this study by their critical readings and 
comments. The assistance of the staffs and patients 
of the Boston and Bedford Veterans Administration 
Hospitals is gratefully acknowledged. An abbreviated 
form of this study was presented at the meetings of 
the Midwestern Psychological Association in May, 
1959 

*Now at the Nebraska Psychiatric Institute, 
Omaha, Nebraska 
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ment. A number of theorists (Cameron, 1943; 
Dunham, 1944; Faris, 1944; Hoskins, 1946; 
Sullivan, 1944) have suggested that schizo- 
phrenia in particular is characterized by a 
breakdown in role-taking skills. Several studies 
(Flavell, 1955; Helfand, 1956; McGaughran 
& Moran, 1956; Sarbin & Hardyck, 1955) 
have borne out this assertion. 

These investigations have not considered, 
however, the possibility that both cognitive 
and empathic abilities are required for an ef- 
fective role-taking performance. In the typical 
role-taking test, the subject is called upon to 
give his own idiosyncratic responses on one 
form of a preference test and to simulate the 
answers of another person or persons on the 
same or a second form of the test. It would 
appear that the cognitive ability to shift from 
giving one’s preferred or dominant responses 
to deliberately giving the nondominant re- 
sponses that characterize another is a pre- 
requisite to simulating the responses of 
another. A second cognitive ability is the 
forming of concepts as measured by the ability 
to select the terms that are normally connoted 
by a conceptual label, and the rejection of 
terms that are not. In the present study where 
Ss are instructed to choose the verbal responses 
that men, women, adults, and children char- 
acteristically give to certain stimulus words, 
these Ss must have, as a cognitive minimum, 
the ability to attach relevant connotations to 
these conceptual headings, men, women, etc. 

It has been generally agreed that brain- 
damaged patients suffer some degree of cogni- 
tive deficit (Hunt & Cofer, 1944). With respect 
to cognitive deficit in schizophrenia, some 
theorists (Goldstein, 1943) maintain that 
schizophrenic patients demonstrate cognitive 
deficits not unlike those observed in brain- 
damaged patients. Others (Cameron, 1939; 
Sullivan, 1944) argue that what is lessened 
or lost in schizophrenia is not cognitive abili- 
ties, but the intent or the ability to take the 
role of the other as a guide to effective com- 
munication with others: 
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Hypothesis I. Schizophrenic patients and 
brain damaged patients are deficient in role- 
taking in comparison with normals. 

Hypothesis II. Brain damaged patients are 
deficient in cognitive abilities in comparison 
with schizophrenic patients. 

Hypothesis III. Schizophrenic patients are 
deficient in role-taking in comparison with 
brain damaged patients when differences in cog- 
nitive abilities are taken into account. 


METHOD 


Subjects 

Three male groups consisting of 32 schizophrenic 
patients, 30 brain damaged patients, and 20 normal 
Ss were used 

Criteria for selection of the schizophrenic group 
included the following: (a) all patients had been given 
a psychiatric diagnosis of schizophrenia, the diagnostic 
breakdown showing paranoid (24), hebephrenic (2), 
catatonic (2), and chronic undifferentiated (4); (5) 
no patient had neurological involvement, had under- 
gone somatic therapy (electric shock or insulin coma) 
within the past six months, or had ever undergone a 
lobotomy 

Criteria for selection of the brain damaged group 
included these observations: (a) all patients had been 
given a neurological diagnosis of damage to the cerebral 
cortex, these diagnoses including vascular disease 
(9), degenerative disease (12), chronic brain syndrome 
(9); (6) no patient had a previous history of severe 
mental disturbance or was known to be psychotic at 
time of testing. 

The three groups were selected with attention te 
age, IQ (as measured by the Vocabulary Subtest of 
the Wechsler-Bellevue Form I), and education. The 
data are summarized in Table 1. Differences in age 
and IQ were not significant. A statistically significant 
but small difference in years of formal education be- 
tween normals and the two patient groups was ob- 
served. 


Role-taking Tests 


Multiple-choice word association tests were chosen 
as measures of role-taking performance because the 
various responses to each stimulus word could be pre- 
coded as to the age or sex group into which they fell. 
More important, normative data were available as 


TABLE 1 
AcE, IQ, anpD YEARS OF ForRMAL EpvUcATION 
Group Age IQ Education 
N M 44.6 116.2 12.0 
SD 15.2 10.0 2.7 
> M 43.1 114.9 10.8 
SD 13.4 15.7 3.2 
BD M 45.6 110.5 11.5 
SD 15.2 17.7 3.2 


to the empirically observed word association preferences 
of these groups. 

Terman-Miles Masculinity-Femininity Word Asso- 
ciation Test (1936). Ss were instructed to choose the 
verbal associations characteristically given by men 
on one form of the test and by women on the other. 
Each form contained 60 items. In any given item, there 
was a stimulus word printed in capital letters and four 
response words in lower case. One or more of these 
responses had been empirically demonstrated to be 
chosen more frequently by men, one or more by women, 
and the remaining responses were nondiscriminating. 
Masculine responses were scored +, feminine —, and 
neutral zero. 

Abstract-Concrete Word Association Test.2 The 
paradigm for this test was taken from Flavell (1956) 
and was based on empirical evidence in the literature 
(Woodrow & Lowell, 1916) and in pilot studies support- 
ing the assertion that adults choose abstractly related 
associations (synonym, superordinate, and subordinate) 
in preference to concretely related associations (part- 
whole, adjective, verb, concrete context) and children 
do the reverse. Each form of the test contained 48 
items. All stimulus words were nouns. Response words 
were nouns, adjectives, and verbs. In any test item, 
the stimulus word bore an abstract relationship to one 
of the response words and a concrete relationship to 
the other, e.g., DOG—animal (superordinate), bark 
(verb). The score was the number of abstractly related 
associations chosen under each of two sets of instruc- 
tions, i.e., to take adult and child roles. 


Cognitive Tests 


Mental Set Shifting Test. This test, taken from 
Appelbaum (1958), involved three presentations of 
50 words from the Kent-Rosanoff list (1938). On the 
first presentation, free associations were obtained; 
on the second, Ss were asked to respond with the same 
word they had given previously; on the third, Ss 
were instructed to give responses different from the 
previous ones. The score was the number of different 
responses on the third presentation. The rationale for 
this measure is that S is called upon to inhibit his own 
dominant responses and to adopt and sustain the 
mental set to give nondominant responses. An analogous 
cognitive process is required in role-taking. 

Concept Forming Test. This test, adapted from Capps 
(1939), involved a discrimination between relevant 
and irrelevant associations to a stimulus word. Ss 
were instructed to select 3 of 7 response words which 
went with the stimulus word in some way. The score 
was the number of items (25 in all) in which all 3 
related words were selected. This measure is analogous 
to a second cognitive requirement of role-taking, 
i.e., that of selecting the terms that are normally 


3 A copy of the Mental Set Shifting, Concept Form- 
ing, and Abstract-Concrete Word Association Tests 
has been deposited with the American Documentation 
Institute. Order Document No. 6133, from ADI 
Auxiliary Publication Project, Photoduplication Ser- 
vice, Library of Congress, Washington 25, D. C., 
remitting in advance $1.25 for microfilm or $1.25 for 
photocopies. 
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connoted by a conceptual label and rejecting the 
terms that are not. 


Procedure 

Ss were tested individually, usually in one session. 
The Mental Set Shifting Test was administered first 
in order to obtain each S’s normally dominant responses. 
Two alternate forms of the Terman-Miles and of the 
Abstract-Concrete Word Association Tests were ad- 
ministered next with instructions to choose the asso- 
ciations of men and women on the first pair and of 
adults and children on the second. The Concept 
Forming Test and Vocabulary Subtest followed. 


RESULTS 


Hypothesis I. Means and standard devia- 
tions of the role-taking scores for the three 
groups are presented in Table 2. It is evident 
that the difference between instructions was 
much greater for the normals than for either 
of the two patient groups. Since the over-all 


TABLE 2 
MEANS AND STANDARD DEVIATIONS OF 
ROLE-TAKING SCORES 


Measure Group M SD D »* 
Male N 6.60 6.00 
BD — .50 6.79 | 7.10 |}<.001 
S —4.28 8.14 | 3.78 |}<.05 
Female N —15.05 7.53 
S —11.35 8.87 | 3.70 13 
BD> | —8.93 9.28 | 2.42 28 
Adult N 35.85 6.27 
S 32.91 6.63 | 2.94 11 
BD 28.90 5.56 | 4.01 |<.02 
Child N 20.05 8.00 
S 23.41 7.72 | 3.36 10 
BD» 26.23 6.83 | 2.82 2 


* All statistical tests are two-tailed 
> Normals were superior to the Brain damaged group on this 


ymparison beyond the .02 level 


TABLE 3 
ANALYSIS OF VARIANCE OF ROLE-TAKING 
SCORES 
Male-Female Adult-Child 
Source - ~ > 
a4 Mean P Mean 
2 Square ? Square p 
Groups 2 | 224.43 — 5.48 
Subjects 79 81.38 60.84 
Instructions 1 (5071.61 |<.001 |2987.80 |<.001 
Instructions 2 | 740.24 |<.001 |1029.63 |<.001 
xX Group 
Subjects X 79 45.19 27.19 
Instructions 
Total 163 


TABLE 4 
MEANS AND STANDARD DEVIATIONS OF 
COGNITIVE TESTS 


Measure | Group Mv SD D | ?* 
= om — — - —— —_——- - 
Mental Set | N 45.30 | 4.41 | | 
Shifting is 42.59 | 8.80 | 2.71 | .15 


Concept Form- | N | 23.70 | 1.14 | 
ing | S_ | 22.03 | 3.62 | 1.67 |<.03 
| BD | 21.73 | 2.70| 0.30| — 


BD | 32.46 |12.08 |10.13 i 





* The / test with Snedecor’s Correction for heterogeneity of 
variance was used (McNemar, 1949). With respect to Concept 
Forming scores, value of H from Kriskal-Wallis analysis of vari- 
ance scarcely approached significance and the differences between 
normals and patient groups are of questionable significance. 


TABLE 5 
ANALYSIS OF COVARIANCE: MENTAL SET 
SHIFTING* ON ROLE-TAKING SCORES 


Before Adjustment After Adjustment 


Measure | Source 


a | Sctre| # | 4 | Sacre 

Male Between| 1 | 221.58| .05| 1 | 718.41 iis 
Within 60 56.53 so | 43.73 | 

Total 61 | & 

Female Between 1 89.96 25 1 2.00; .89 
Within 60 82.22 59 80.16 | 
Total 61 60 

Adult Between 1 248.52 |<.02 1 227.03 |<.02 
Within 00 37.46 59 37.89 | 
Total 61 60 

Child Between 1 | 123.75 11 1 11.31 | .88 
Within @ | 45.55 59 | 43.44 
Total 61 60 | 
' 


* An Arcsin transformation (Walker & Lev, 1953) was used to 
make the variances homogeneous as a necessary condition for 


covariance adjustment. 


interaction between Instruction and Groups 
was significant and in the predicted direction 
(Table 3), Hypothesis I was confirmed. Com- 
parison of the three groups on the separate 
scores shows that normals were superior to 
both patient groups on male scores, but only 
to the brain damaged group on female scores. 
Normals were superior to the brain damaged 
group on both adult and child scores, but were 
not significantly different from the schizo- 
phrenic group on either. 

Hypothesis II. Means and standard devia- 
tions of scores on the cognitive tests are pre- 
sented in Table 4 and show that the hypothesis 
is confirmed with respect to mental set shifting. 








te 
nm 
nN 


The difference in concept forming between the 
two patient groups was not significant. 

Hypothesis IIT. Hypothesis III held that 
the mean role-taking score of the brain 
damaged group is greater than that of the 
schizophrenic group when a correction is made 
for differences in cognitive abilities (i.e., 
mental set shifting). This assertion was con- 
firmed for male scores only. The data on analy- 
sis of covariance of mental set shifting scores 
and the four role-taking scores are summarized 
in Table 5. Analyses of covariance for female 
and child scores revealed no significant differ- 
ences between the patient groups, although 
the adjustment was ia the predicted direction. 
With respect to adult scores, the schizophrenic 
group was superior both before and after 
covariance adjustment. 


DISCUSSION 


Hypothesis I. Experimental findings have 
confirmed the hypothesis that schizophrenic 
and brain damaged patients are deficient in 
role-taking performance. These findings are 
consistent with interpersonal theory (i.e., 
Cameron, 1939), which states that schizo- 
phrenics are deficient in taking the role of the 
other, and with cognitive theory (i.e., Gold- 
stein, 1943), which states that brain damaged 
patients do poorly in role-taking tasks because 
of their deficits in the cognitive abilities re- 
quired for effective role-taking. 

The specific deficiency of the schizophrenic 
group in taking the male role is also consistent 
with the psychoanalytic theory of a relation- 
ship between paranoid schizophrenia (24 of 
the 32 schizophrenic patients were paranoid) 
and feminine identification (Freud, 1924). It 
is worthwhile noting that the use of the 
Terman-Miles Word Association Test to dis- 
criminate between normals and paranoid 
schizophrenics has singularly failed until now. 
Musiker (1952) cites a number of studies, 
including his own, to make this point. In these 
earlier studies, the test was administered with 
instructions “‘to choose the word that goes 
best or most naturally with the stimulus 
word.” Male Ss tended to choose both male 
and female responses with the latter in the 
majority by a narrow margin. In the present 
study where Ss were specifically instructed 
to discriminate male and female responses, 
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normals were successful in this deliberate 
effort, but schizophrenics were not. 

Hypothesis IT. Results on the Mental Set 
Shifting Test suggest that the inability to 
inhibit one’s dominant or preferred mode of 
response is a specific characteristic of thought 
pathology in organic brain damage. Schizo- 
phrenics as a group were not significantly 
different from normals. The few schizophrenics 
who did poorly on this test tended to be older, 
chronically ill patients. Their “brain-damaged- 
like” performance may have been related 
either to chronicity of illness, to the mental 
deterioration that presumably accompanies 
senescence (Wechsler, 1944), or both. Rank 
order correlation coefficients between age and 
mental set shifting were .46 for the brain 
damaged group and .61 for the schizophrenic 
group. The higher correlation for the schizo- 
phrenic group may reflect a compounding of 
chronicity of illness and age. The older schizo- 
phrenic patients had been hospitalized for 
many years (up to 40), whereas the brain 
damaged patients, regardless of age, had all 
had fairly recent onsets of illness and 
hospitalization. 

It is felt that the Concept Forming Test 
failed to support the hypothesis because of 
its low ceiling of difficulty. Even the difference 
between the normals and the two patient 
groups was of questionable significance. 

Hypothesis IIT. The magnitude of the cor- 
relations which obtained between mental set 
shifting and male-female role-taking scores 
confirms the original contention, that the role- 
taking task has cognitive as well as empathic 
requirements. The covariance adjustment of 
the male scores highlights the specifically 
empathic deficiency of the schizophrenics in 
taking the male role. 

Several explanations come to mind to ac- 
count for the failure of the brain damaged 
group on the adult-child role-taking polarity. 
This task implicitly requires a discrimination 
between abstract and concrete word relation- 
ships. Considerable evidence has shown that 
brain damaged patients are especially deficient 
in this discrimination (Milgram, 1959); hence, 
they are penalized by an additional cognitive 
requirement which does not apply in the male- 
female role-taking task. Moreover, in the male- 
female polarity, an S is aided in his choices by 
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specific knowledge about the habits, interests, 
feelings, and attitudes of men and women; on 
the adult-child polarity, a subject is called 
upon to identify a subtle quality of thinking 
that is reflected in language usage. Such a task, 
requiring a discrimination between the thought 
characteristics of adults and children, may be 
especially difficult for the brain damaged 
group. 

At any rate, the performance of the brain 
damaged group on the adult-child polarity was 
so poor as to recommend the test as a little 
understood but highly interesting diagnostic 
tool. If 7 is taken as the cutoff point for each S, 
then three normals were misdiagnosed because 
of their failure to shift at least 7 points from 
adult to child score, whereas only two brain 
damaged Ss were misdiagnosed because they 
shifted more than 7 points. 

Implications for future research extend in 
several directions. The present study should be 
supplemented by testing the ability of female 
schizophrenics to take male and female roles. 
Theoretically, an underlying masculine identifi- 
cation would result in a poor performance on 
female role. A second study should deal with 
the adult-child polarity in children. It would be 
interesting to learn at what age a child follows 
the adult pattern of preference for abstract 
relationship in his free associations and in his 
taking the role of adult. This study would pro- 
vide a clue to the relationship between role 
perception and role enactment. Finally, the 
diagnostic implications of the various cog- 
nitive and role-taking tests employed herein 
merit further study. 


SUMMARY 

The purpose of this investigation was two- 
fold: to study the nature of role-taking per- 
formance in schizophrenic and brain damaged 
patients, and to contribute thereby to an 
understanding of thought pathology in these 
patient groups. Schizophrenic and nonpsy- 
chotic brain damaged Ss were administered 
four multiple-choice word association tests 
which required shifting from the male to fe- 
male role on one pair and from the adult to 
child role on the other. The latter, for which 
empirical norms were obtained, required an 
implicit discrimination between abstract and 
concrete relationships of stimulus and re- 


sponse words. The most relevant cognitive 
aspect of the role-taking performance was 
tapped by a task requiring the ability to in- 
hibit one’s preferred response in favor of 
another. 

When role-taking scores were adjusted for 
differences in this cognitive ability, the brain 
damaged group wys superior to the schizo- 
phrenic group on ‘the male-female polarity, 
chiefly because of the schizophrenics’ inability 
to take the male role. The brain damaged 
group failed completely on the adult-child 
polarity to discriminate the abstract from the 
concrete relationship with the result that the 
schizophrenics were superior in the shift from 
adult to child role. Both patient groups were 
deficient in comparison with normal controls 
on both role-taking polarities. Results were 
interpreted as indicating that lowered role- 
taking performances in schizophrenic and 
brain damaged groups were related to em- 
pathic and cognitive deficiencies respectively. 
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PRODUCTION OF ASSOCIATIVE SEQUENCES IN SCHIZOPHRENIA 
AND CHRONIC BRAIN SYNDROME“? 


JOHN R. LESTER 
Norwich State Hospital, Connecticut 


LTHOUGH Bleuler (1950, p. 32) was con- 
cerned with extreme time variations in 
the associative process as character- 

istic of pathological thinking, he had to ac- 
knowledge that, “We know almost nothing 
relative to the time relations of the schizo- 
phrenic associative process.” Recently Arieti 
(1955) has reaffirmed our ignorance on this 
score. 

In an effort to clarify Bleuler’s notion of 
time differences in association among nosologi- 
cal groups, this study applied a relatively new 
technique to examine the temporal character- 
istics of production under conditions of re- 
stricted association and of recall. The associ- 
ative productions of four groups (control, 
paranoid schizophrenic, hebephrenic schizo- 
phrenic, chronic brain syndrome epileptic) 
were considered within the framework of 
Hebbian Theory (1949). 

Bousfield, Sedgewick, and Cohen (1954), 
employing ncrmal subjects, developed a mathe- 
mathical method to describe verbal production 
in terms of analytical scores. The unique 
feature of this method is that it permits an 
analysis of the efficiency of the production 
process independent of the content of associ- 
ations. Where subjects are asked to name cities 
in the United States, for example, their re- 
sponses occur readily at first, but the rate of 
production diminishes with time. Thus, within 
the limits set by the instructions, S has a 
supply of associates, and the rate at which this 

1 This study is the condensed version of a disserta- 
tion submitted in partial fulfillment of the require- 
ments for the Ph.D. degree at the University of Con- 
necticut, June, 1957. 

The author is indebted to the members of his 
committee, Margaret M. Riggs, Weston A. Bousfield, 
James M. Sakoda, and Hermann O. Schmidt, for 
their encouragement and patient direction during the 
preparation of this study. In addition, the author 
wishes to thank Ronald H. Kettle, Superintendent, 
Norwich State Hospital, for his cooperative support; 
Jules D. Holzberg, Connecticut State Hospital, who 
graciously assisted in the selection of some of the sub- 
ects; and Sidney A. Orgel, Veterans Administration 
Hospital, Syracuse, for his helpful suggestions and 
for the loan of the apparatus 


2 


? 


supply is depleted is proportional at any given 
moment to the number of associates remaining 
in reserve (Bousfield et al., 1954). When 
familiarity of the items to be recalled is high, 
the production of associates virtually ceases 
within 10 min., and the empirically obtained 
production curve can be expressed by an hyper- 
bolic equation.* 

It was recently suggested that the hebe- 
phrenic manages adjustment by a form of 
cortical activity termed “intrinsic organiza- 
tion” (Orgel, 1956, p. 26). Intrinsic organiza- 
tion involves: a heightened limen for sensory 
stimulation; a loss of purposeful, persistently 
directed activity; a reduction in memory; a 
loss of superordination (Hebb, 1949, p. 122). 
Since superordination is weak, the hebephrenic 
should manifest a marked deficit in efficiency 
of production. Deficit should, therefore, appear 
not only in the form of a reduced supply of 
associates, but also in a slow rate of depletion. 

Similar predictions concerning the verbal 
productions of the epileptic should follow, for 
the behavioral correlates of intrinsic organiza- 
tion do seem to parallel the descriptive account 
of the epileptic’s behavior (Orgel, 1956). How- 
ever, one qualification is in order. It cannot be 
assumed that the psychopathology of all 
epileptics is as the simple consequence of 
cortical impairment. Functional factors, not 
unlike those in hebephrenia, can also be pres- 
ent. In some cases then, the efficiency of the 
epileptic’s production may be doubly limited. 
Group data should show a reduced supply of 
associates, and the rate of depletion of even 
the limited supply should be slightly slower 
than that of the hebephrenics and markedly 
slower than that of the controls. 

The behavioral correlates of intrinsic organi- 
zation do not appear to be present in paranoid 
schizophrenics who have been reported to be 
cont/l + cmét:n is the total number of words 
produced in time, #; ¢ is the asymptote approached 
by the function, rationally defined as the total number 
of items available for recall; m is a constant, the slope 
factor, the rate of depletion of the supply (Bousfield, 
Sedgwick, & Cohen, 1954, p. 116) 


in = 


5 
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(Rapaport, 1951; Berze, 1951), 
strongly motivated (Cameron, 1947), pos- 
sessed of keen memory (Bleuler, 1950), or 


without a loss of superordination (Orgel, 1956). 


hyperalert 


Rather, there are well-established phase 
sequences (thought changes) whose stability 
and resistance to modification testify to the 
domination of the delusional superordination. 
In Hebbian terms, well-developed phase se- 
quences depend for their existence on continued 
but mild inconsistencies in sensory “feedback.” 
Thus, the security operations of the paranoid 
eem to require constant, though distorted, 
interaction with others (Sullivan, 1953, p. 328). 
One may infer, therefore, that the association 
process of the paranoid schizophrenic is char- 
acterized by a minimum of disturbance in 
superordinate relationships, and a careful 
selectivity should be manifest in his associative 
sequences. Thus, though the supply of associ- 
ates should be reduced, the paranoid should 
approach the control group in his rate of 
depletion. 

As a working assumption, it was proposed 
that the rate of depletion reflects the amount 
of interference present during the selection of 
associates for production. The less the disrup- 
tion of the phase cycles, the more firmly estab- 
lished is the superordinate structure; the less 
the interference, the faster is the rate of de- 
pletion and the more temporally efficient is the 
associative process, once allowance has been 
made for differences among the groups in the 
available supply of associates (gross output). 


METHOD 

Subjects 

The Ss were: 30 paranoid schizophrenics and 30 
hebephrenic schizophrenics, chosen according to ob 
jective nosological criteria (Orgel, 1957); 30 chronic 
brain syndrome epileptics of unequivocal diagnoses,‘ 
and 30 controls with no known history of psychiatric 
disturbance. The four groups, each of 15 male and 
15 female Ss, were comparable for age, years of educa 
tion, Stanford-Binet vocabulary IQ (Cureton, 1954), 
and length of hospitalization (institutionalized groups). 
The relevant data are presented in Table 1. 

All 120 Ss were examined individually. Patients 
were tested in a side room on the wards in which they 


‘Since the supply of female patients in this category 
was limited, it was necessary to select eight of this 
group from the Connecticut State Hospital. All other 
institutionalized subjects were residents of the Norwich 
State Hospital 


TABLE 1 
NorMaTIve Date For EquatTep Groups 
(N = 30 in Each Group) 


Hebe 


Control | Paranoid phrenic Epilept 

Chronological Age 
Mean 39.5 42.8 40.4 38.6 
SD 6.9 5.2 7.4 6.4 
Range 29-50 | 33-50 | 24-50 | 26-48 

Years of Schooling 
Mean 10.3 10.0 9.7 9.1 
SD 1.9 2.3 2.4 3.0 
Range 7-14 7-19 6-16 6-16 


Stanford-Binet 
Vocabulary IQ 


Mean 116.2 (115.0 |110.8 (107.8 
SD 14.6 14.3 12.8 12.7 
Range 90-144; 91-145) 91-134; 92-130 


Months of Hospitali- 


zation 
Mean 135.9 (142.2 (119.3 
SD 74.0 72.2 67.8 
Range 34-320 45-302 33-352 


resided. Control Ss were examined in their homes 
After S had entered the examining room, he was given 
a paper and pencil and asked to write his name and 
the sentence, “I hope to leave here very soon” (Bat 

cock, 1930). When it was clear that he was able to 
comply with these instructions, he was presented with 
Task I. 

Task I. Following Orgel’s (1956) procedure, a 
single category (food) list of 40 two-syllable words 
was presented visually by memory drum. After three 
randomized presentations of the list, S was instructed 
to write down an many of the words as he could 
remember. E recorded the cumulative production at 
15-sec. intervals for a period of 10 min. 

Task II. After the termination of Task I, £ said 
“That was fine. Now I have something a little dif 
ferent for you to try.” E presented S with a sheet of 
lined paper and gave the following instructions: 

When I say, “Go,” I would like you to writ 
down the names of all the colors that you can 
think of. You will have ten minutes to finish. After 
the first few minutes everyone finds it hard to think 
of any more colors. If this happens to you, you 
need not tell me about it and don’t worry about it 

Just sit back and try to remember more colors 

Once we start, you are to write down as many names 

of colors as you can remember until I say, “Stop 

Are there any questions? 

When it was clear that the instructions were under 
stood, E said, “Ready? Go.” E recorded the cumulative 
production at each 15-sec. interval. Association was 
terminated after 10 min. 

Pilot investigation had indicated that initial asso 
ciations within the color category have a high degree 
of readiness, and the cumulative production curve 
virtually reached asymotote within ten minutes. This 
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pattern of response was consistent for both well edu- 
cated normals and psychotic Ss. 

On termination of Task II, Z administered Sections 
IIi and V of the Wechsler Memory Scale (Wechsler, 
1945). Section III, Mental Control, was selected 
because the nature of the controlled association re- 
quired is not unlike that demanded in Task II. Section 
V, Digit Span, was administered in standard fashion 
to obtain an independent index of immediate memory. 


Classification of Responses 


Bousfield (1954) in his method of analysis of the 
verbal productions of normal Ss classifies any written 
response as an instance of recall whether or not it 
actually corresponds to a word on the stimulus list 
In the present study, a modification of this approach 
was introduced to take account of the intrusions which 
occur frequently in the productions of psychotics. 
Often these intrusions were inappropriate to the de 
mands of the task: e. g., “saxophone” as part of the 
recall of a previously presented stimulus list of food 
words. Thus, for both Tasks I and II, analyses were 
performed at two levels: 

Level 1: Relaxed Standard: To cover all written 
responses, including irrelevant intrusions, neol 
ogisms, perseverations, etc. 

Level 2: Rigid Standard: To apply only to the first 
instance of each response which complies specifi- 
cally with the task requirements 

Therefore, in Task I, Level 2, only the first occur- 
rence of words which appeared on the stimulus list 
was credited. In Task II, Level 2, only the first oc- 
currence of a word which actually appeared on either 
of two standard lists of color names (Kem-tone Color 
Charts;> Standard Color Card of America®) was 
acceptable. In all instances, where a response could 
not be found on the standard list of color names, 
Webster’s Collegiate Dictionary (1937) was consulted 
as a final authority. If the dictionary definition of the 
questionable word included no refegence to color, the 
response was classified as irrelevant. 


RESULTS 

The first step in analysis involved the com- 
putation of the cumulative totals of responses 
at the end of the first minute of production 
and at each successive 2-min. interval for each 
S on Tasks I and II, Level 1 and 2. The means 
and standard deviations of the cumulative 
totals of associates produced for the four 
nosological groups are presented in Tables 2 
and 3. A summary of the results based on the 
average curves of the four nosological groups, 
Tasks I and II, Levels 1 and 2, appears in 
Table 4. 


5 Kem-tone Commercial Color Charts. Detroit: 
Rogers Co., 1956. 

* The Textile Color Card Association of the United 
States, Inc. Standard color card of America (9th ed.) 


New York: 1941. 


Total Product 


For the analysis dealing with total product, 
both in the recall of a previously presented 
stimulus list (Task I) and under the condition 
of minimally restricted association (Task II), 
at the two levels of analyses, significant differ- 
ences in productivity occurred among the 
groups (p < .001 in each of the four analyses 
of variance). Group means followed the pre- 
dicted order—control, paranoid, hebephrenic, 
and epileptic in three of the four conditions. 
The one exception appeared in Task I, Level 2, 
where the epileptics produced more acceptable 
associates than the hebephrenics. Between- 
groups ¢ tests revealed that the control group 
consistently differed from the patient groups in 
total production on Tasks I and II at both 
levels of analysis; among the patient groups, no 
other significant differences were obtained. 

On the independent measures, Digit Span 
and Mental Control scores, the results followed 
the same pattern as the findings for Tasks I and 
II, i.e., the ordering of group means was in the 
same sequence. Only the controls differed 
significantly from the other groups; among the 
patient significant differences 
appeared. 


groups, no 


Temporal Characteristics of the Association 

Process 

Application of the equation for the hyper- 
bola to each individual cumulative curve of 
production yielded score distributions for each 
group. Thus, for every S, values were obtained 
for c, m, and t% (cf., Fn. 3) for the two tasks, 
each at two levels of analysis. 

Bousfield (1954) defined m as an inverse 
measure of interference, i.e., other things equal, 
the greater the magnitude of m, the less fre- 
quently are associations interfering with each 
other during production, and the faster is the 
rate of depletion. There is the implication of 
“interference” or “friction” between com- 
peting associates which occurs as S successively 
selects and rejects verbal concepts for appropri- 
ateness of response. Selection consumes time; 
thus, with supply constant, the more smoothly 
the selection proceeds, the greater is the magni- 
tude of m, and the faster is the rate of de- 
pletion. 

Differences among the groups in rate of de- 
pletion can be meaningfully evaluated only 
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TABLE 2 

Means anp SD or Torats OF Worps PropuceD As A FUNCTION OF Time SPENT IN PRODUCTION FOR THE Four 
NosoLocicaL Groups: Task I, Lever 1 (m) anp LeveL 2 (n — 4) 

(N = 30 in Each Group) 


Time (in Min.) After Starting Signal 








Group i 2 4 5 8 10 

M SD M SD M SD M SD M SD M SD 

Level 1 Control 11.57 | 2.78 | 15.33 | 3.66 | 18.07 | 4.26 | 19.70 |5.09/21.13)5.15|21.70) 5.32 
n) Paranoid 8.03 | 2.50 | 10.03 | 3.51 | 12.60 | 4.30 | 14.00 |4.77|15.23)4.80)15.87| 5.10 
Hebephrenic 6.53 | 3.41 8.80 | 3.92 | 11.10 | 4.90 | 12.53 |5.32/13.90'6.19)15.47| 7.38 

Epileptic 5.53 | 2.35 | 8.67 | 3.41 | 11.17 | 4.15 | 12.63 |4.55/13.73'5.01/14.53) 5.38 
Level 2 Control 11.53 | 2.81 | 15.23 | 3.74 | 17.70 | 4.16 | 19.57 |5.03\20.40/5.11/20.93) 5.26 
(m — #) | Paranoid 7.63 | 2.35 | 9.60 | 3.43 | 12.10 | 4.07 | 12.97 |4.76)13.57/4.88)14.03) 5.13 
Hebephrenic 6.20 | 3.54 7.97 | 4.12 9.57 | 4.75 | 10.37 |5.03)11.03\5.25)11.47| 5.46 
Epileptic 5.30 | 2.49 | 8.27 | 3.55 | 10.60 | 4.22 | 11.80 |4.59)12.604.89)13.03) 5.09 


TABLE 3 
Means ano SD or Torats oF Worps PRODUCED AS A FUNCTION OF Time SPENT IN PRODUCTION FOR THE Four 
Noso.ocicat Groups: Task II, Levert 1 (m) ANp Levet 2 (nm — 4) 
(N = 30 in Each Group) 


Time (in Min.) After Starting Signal 


Group i 2 4 6 8 10 

Ml SD Mu SD M | SD M SD| M |SD\| M SD 

Level 1 Control 11.80 | 2.39 | 16.30 | 3.53 | 21.37 | 5.56 | 25.37 |7.13)/28.40/9.13|30.93/10.44 
(m) Paranoid 9.53 | 1.71 | 11.83 | 2.42 | 15.30 | 3.73 | 17.17 |4.37|18.33)5.13)19.33) 5.74 
Hebephrenic 7.73 | 3.09 | 10.43 | 3.80 | 13.63 | 5.48 | 15.33 6.70:16.40)7.34)17.77| 8.26 

Epileptic 7.27 | 3.09 | 10.00 | 3.64 | 12.60 | 3.7 14.27 |4.41/16.17|5.25)17.43) 5.85 

Level 2 Control 11.77 | 2.33 | 16.23 | 3.36 | 21.07 | 5.18 | 24.93 |6.58/27.93|8.66|30.40) 9.96 
(mn — ¢) | Paranoid 9.50 | 1.74 | 11.80 | 2.41 | 15.03 | 3.32 | 16.80 |3.9417.83'4.42)18.57) 4.74 
Hebephreni 7.70 | 2.99 | 10.17 | 3.66 | 13.17 | 5.25 | 14.73 |6.47/15.63/7.05/16.67| 7.90 

Epileptic 7.23 | 3.08 9.63 | 3.28 | 11.90 | 3.56 | 13.00 |3.85)14.33'4.42)15.03) 5.00 


TABLE 4 
Derrvep CONSTANTS AND HALFr-Lives ror RATIONAL EQuaTION Fitrep To Grove Data For Four NOSOLocIcaAl 
Groups: Tasks I anp II, LEvets 1 (m#) AND 2 (nm — i) 


¢c m th 
e yen i = " < mat nn a ie n 7 ot ong 
Task I Control 24.18 23.27 .0340 .0381 1.22 2.83 
Paranoid 18.18 15.55 .0346 .0561 1.59 1.15 
Hebephreni 18.01 12.69 .0255 .0659 2.18 1.20 
Epileptic 17.48 15.38 .0265 0362 2.16 1.80 
Task I Control 38.81 37 .86 .0090 0095 2.87 2.78 
Paranoid 22.28 21.28 .0267 0306 1.68 1.54 
Hebephreni 20.72 19.25 .0248 0298 1.95 1.74 
Epileptic 20.72 17.08 .0214 0370 2.26 1.58 


after the influence of group differences in pendent variable and m as the dependent 
available supply have been controlled. For variable. The results of the four analyses of 
Tasks I and II, Levels 1 and 2, analyses of covariance were as follows: Task I, Level 1, 
covariance were performed with c as the inde- F = 11.78, p < .001; Task I, Level 2, F = 8.30, 
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TABLE 5 
UNADJUSTED MEAN INDICES OF INTERFERENCE AND MEAN INDICES OF INTERFERENCE ADJUSTED FOR DIFFERENCES 
IN THEORETICAL SupPPLY OF AssocIATEs: Tasks I AND II, LEVELS 1 AND 2 


Task I Task II 

Level i 2 ys aa c 

Mean Adj. =p Unadj Unadj Adj. Unadj * "Adj. Unadj. - 
Control 0605 .0356 .0861 .0422 .0323 .0124 .0387 .0130 
Paranoid 0355 .0363 .0599 .0703 .0273 .0326 .0329 .0352 
Hebephrenic .0261 .0325 0452 .0689 0223 .0351 .0274 .0421 
Epileptic .0237 0289 .0329 .0374 .0177 .0244 .0260 .0450 

TABLE 6 


p < .001; Task II, Level 1, F = 3.82, p < .05; 
Task II, Level 2, F = 1.38, ns. The adjusted 
and unadjusted mean indices of interference 
appear in Table 5. For Task I, Levels 1 and 2, 
and Task IT, Level 1, six analyses of covariance 
compare the groups two at a time. The Fs ob- 
tained by these analyses are presented in 
Table 6. 

These findings indicate that differentiation 
between groups, with regard to the amount of 
interference manifest, increased directly with 
the number of the restrictions placed on associ- 
ation. Under the condition approaching free 
association, if inappropriate responses were 
ignored (Task II, Level 2), no differences were 
evident in interference. As increased restric- 
tions were imposed on the association process, 
a continuum of deficit in the ease of synchroni- 
zation of appeared. Maximum 
differentiation among groups was achieved in 
Task I, Level 1. The continuum which emerged 
as an important result of the four analyses is 


associates 


presented in Table 7. 


Efficiency of Production 


Efficiency (t}4), as here employed, is not a 
measure of simple rate of production of associ- 
ates. Rather, it is based on the relationship 
between the supply of items available for pro- 
duction and the amount of interference pres- 
ent during production, i.e., 444 = 1/cm. Since 
c and m are highly correlated, m was chosen as 
the independent variable for covariance 
analysis, not because it is more effective than 
c in exerting contro] on (4%, but because both 
n and t% are traditionally empirical measures 
of production and efficiency respectively 
(Bousfield, Sedgewick, Cohen, 1954; Johnson 
1955). 


The results of the four analyses of covariance 


Fs OBTAINED IN TESTS OF DIFFERENCES BETWEEN 
ADJUSTED MEAN INTERFERENCE INDICES FOR SIX 
SETs OF COMPARISONS BETWEEN NOSOLOGICAL 
Groups: Task I, LEVELs 1 AND 2; Task 


II. Lever 1 
Task I | Task 1 
Groups — 
Level 1 Level 2 Level 1 
Control vs. Paranoid 13.45* | 3.50 3.25 
Control vs. Hebephrenic | 26. 16* 8.43> 2.08 
Control vs. Epileptic 31.38 | 19.76° | 10.36 
Paranoid vs. Hebephrenic| 3.64 2.18 1.15 
Paranoid vs. Epileptic 5.12° | 7.8@ 8.22» 
Hebephrenic vs. Epileptic .28 9.32» 3.99 
a p< .0S 
> p< 01 
°p < .001 
(n = X; t% = FY) are: Task I, Level 1, 


F = 7.45, p < .001; Task I, Level 2, F = 4.33, 
p < .01; Task II, Level 1, F = 1.93, ns; Task 
II, Level 2, F = 0.71, ms. The adjusted and 
unadjusted mean indices of efficiency appear 
in Table 8. For Task I, Levels 1 and 2, six 
separate analyses compare the groups two at a 
time. Fs obtained by these analyses are pre- 
sented in Table 9. 

For Task I, Level 1, the controls proved to 
be more efficient than all patient groups, after 
the influence of sheer output of associates was 
removed. At this level of analysis, the paranoids 
surpassed the epileptics in efficiency. The hebe- 
phrenic group lay between the paranoid and 
epileptic groups, but differed significantly from 
neither. 

In Task I, Level 2, after intrusions were re- 
moved, the control vs. epileptic and paranoid 
vs. epileptic comparisons revealed the only 
significant differences. The fact that the hebe- 
phrenic and paranoid groups no longer differed 
from the control group at this level of analysis, 
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TABLE 7 
PATIENT GROUPS WHICH MANIFEST A SIGNIFICANTLY 
GREATER AMOUNT OF INTERFERENCE WHEN 
COMPARED WITH THE CONTROL GROUP 
AS A FUNCTION OF THE TASK AND 
LEVEL OF ANALYSIS 


Task Level Groups 
I 1 Paranoid, Hebephrenic, 
Epileptic 
2 Hebephrenic, Epileptic 
Il I Epileptic 
2 None 
TABLE 8 


UNADJUSTED MEAN HALF-LIves AND MEAN 
Hatr-Lives ADJUSTED FOR DIFFERENCES 
tn Propuction: Tasks I anp II, 


LEVELS 1 AND 2 
Task I Task II 
Level 1 Level 2 Level 1 Level 2 
2 i5s/2/5/;/2;/5/2/5 
Control 92 1.19 91, 1.08 1.73 | 2.42 | 1.44 | 2.29 
Paranoid 1.50 1.37 8 or 1.50 1.44) 1.38 | 1.36 
Hebephreni 1.99 1.84 1.34; 1.22 1.78 1.51 1.67 1.37 
Epilepti: 2.31) 2.03 | 1.86) 1.77 | 2.33 | 2.04 | 1.72 | 1.34 


TABLE 9 
Fs OBTAINED IN Tests OF DIFFERENCES BETWEEN 
ADJUSTED MEAN HALr-Lives For Srx SETS OF 
COMPARISONS BETWEEN Four NOSOLOGICAL 
Groups: Task I, LEVEts 1 AND 2 


Groups Level 1 Leve 

Control vs. Paranoid 5.208 1.54 
Control vs. Hebephreni 4.65" 1.36 
Control vs. Epilepti« 16. 30° 8.22» 
Paranoid vs. Hebephreni 1.54 1.94 
Paranoid vs. Epileptic 4.16* 8.07 
Hebephrenic vs. Epileptic 46 2.76 

a p< os 

b ; 01 

p « wl 


as was the case at Level 1, suggests that it is 
the presence of intrusions which accounts for a 
significant reduction in efficiency in both the 
hebephrenic and paranoid groups at Level 1. 
Because /44 depends directly on m for its 
power of differentiation, ¢44 is best considered 
as a summary score. The value of /4 as a 
measure lies in its ability to express a complex 
interaction between interference and the theo- 


TABLE 10 
NUMBER OF SUBJECTS IN Eacu Group (N = 30) 
Wro Propuce ONE OR More [IRRELEVANT 
Intrusions (Tasks I anp IT) 


Irrelevant Tasks Categorical Task 
I II I 


Control 0 6 6 
Paranoid 3 4 16 
Hebephrenic 10 10 15 


Epileptic s 10 10 


retical supply in units of time. However, un- 
adjusted half-lives tend to be misleading (cf., 
ordering of unadjusted mean half-lives vs. order 
of adjusted mean half-lives for all groups, 
Table 8). Thus, it is only after adjustment for 
differences in output that half-lives provide 
meaningful comparisons when applied to groups 
grossly differing in volume of production. 


Intrusions 

Separate chi square analyses were performed 
for the number of Ss in each group having no 
irrelevant intrusions vs. the number having one 
or more irrelevant intrusions (Tasks I and II), 
and for the number of Ss giving no categorical 
intrusions vs. the number giving one or more 
categorical intrusions (food words not on the 
list) in Task I. The data for the production of 
intrusions in the three situations is presented 
in Table 10. 

The over-all chi square for the number of Ss 
who gave one or more irrelevant intrusions 
were: Task I, 14.87; Task II, 13.34 (which in 
both cases for 3 df are significant beyond the 
.01 level). In Tasks I and II chi square com- 
parisons between the groups, two at a time 
(Yates correction 1 df), yielded the values pre- 
sented in Table 11. 

The over-all chi square for the number of Ss 
who produced one or more categorical in- 
trusions in Task I was 9.07, 3 df, p < .05. In 
chi square comparisons between groups, two 
at a time (Yates correction 1 df), significant 
differences were as follows: control vs. para- 
noid, » < .02; and control vs. hebephrenic, 
p < .05. 

In terms of percentage of the total product 
in Task I, the hebephrenic and paranoid groups 
manifested a stronger tendency to give cate- 
gorical intrusions than appeared in the control 
and epileptic groups. Perseverations made up a 
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TABLE 11 
p VALUES FOR CHI-SQUARE COMPARISONS BETWEEN 
Groups ON NuMBER OF SuBJEcTS WHO GIVE 
ONE OR MORE IRRELEVANT INTRUSIONS 
(Tasks I anv II) 


Para- Hebe- 


Control noid phrenic Epileptic 
Epileptic 05 ns ms 
Hebeph i 01 10 ns : 
Task I a th. Task II 
Paranoid ns ms 01 
Control ns ns .02 


relatively large proportion of the intrusions 
produced by epileptics, particularly in Task II. 

For all groups in both tasks, the absolute 
number of intrusions did not increase ap- 
preciably with time from the starting signal. 
However, there was a general tendency for all 
groups to give a higher percentage of intrusions 
as the output of appropriate associates de- 
creased over time. On Task I, this effect in the 
hebephrenic group was quite marked; in the 
8-10 min. interval almost 75% of the responses 
produced were words other than those which 
appeared on the original stimulus list. In 
Task II, it was the epileptics who showed the 
strongest tendency to give an increasingly 
higher percentage of irrelevant and persevera- 
tive intrusions, as time from the starting signal 
increased. 

The findings with regard to intrusions can 
be related to the earlier results on group differ- 
ences in interference and efficiency. Groups 
which introduce the greater number of in- 
trusions toward the end of the production 
period also show a greater relative improve- 
ment (decrease in interference, increase in 
efficiency) when intrusions are removed, be- 
cause, with intrusions eliminated the production 
curves of these groups tend to reach asymptote 
more quickly. 

Nevertheless, the results of the analyses did 
reveal significant differences among the groups 
even after the intrusions were removed. Thus, 
the intrusions do not by themselves account 
for all differences among the groups on the 
measure of interference and efficiency. 

DISCUSSION 

While the temporal characteristics of the 
production process defined the general area of 
interest in this study, it should be apparent 
that it is the interference constant, m, which 
stands as the most significant index. 


Two variables, interference (m) and theo- 
retically available supply (c), were derived 
from the cumulative production curve as 
separate aspects of the same data, and they 
shared considerable common variance (of the 
order r = .80). Yet when the variance in m 
common to ¢ was removed, the residual 
variance in m as a measure of interference pro- 
vided an excellent basis for differentiation 
among the groups. 

The controls were found to differ from the 
patient groups with respect to gross output of 
words under the four conditions of measure- 
ment, confirming earlier studies reporting a 
greater total output for normals as compared 
to patient groups. However, where the focus 
shifted to group differences in rate of response 
(after allowance was made for difference in 
supply), differentiation between the groups 
clearly improved. 

Since it was the hebephrenic and epileptic 
groups who manifest the behavioral correlates 
of intrinsic organization, as opposed to the 
paranoids who do not, the prediction was made 
that the hebephrenics and epileptics would 
reveal more evidence of disturbance in super- 
ordination than the paranoids or controls. The 
results, in general, tend to support this position 
derived from Hebb. It was the epileptics and 
hebephrenics who showed considerable inter- 
ference in the selection of associates, which 
appeared as a slowing in the flow of the avail- 
able responses. On the grounds that paranoids 
are characterized by a “‘lasting modification of 
thought,” which, however, does not imply im- 
paired superordination, it was predicted that 
the paranoids would approach the controls with 
respect to efficiency of superordination. The 
paranoids did approach the controls, and both 
these groups revealed much less interference 
in their associative sequences than the hebe- 
phrenics and epileptics. 

In terms of the amount of interference, once 
allowance had been made for differences in the 
available supply, the predicted ordering from 
most to least (epileptic, hebephrenic, paranoid, 
control) was consistently confirmed for the 
recall situation (Task I) and in the more 
general condition of restricted association 
(Task II), whether the appropriateness of the 
responses (Level 1 vs. 2) was considered or not. 

In this experiment, S, in order to perform 
efficiently, must adopt one set at a time, i.e., 








232 Joun R. LESTER 


(Task I) name only foods on the stimulus list 
or (Task II) name only colors. Thus, where 
intrusions occur, there is behavioral evidence 
that interfering systems have invaded the 
leading system (Angyal, 1941, p. 309). The 
patient groups, particularly the hebephrenics 
and epileptics, seemed unable to resist the in- 
vasion of competing systems (quite possibly 
the systems associated with the emotional com- 
plexes, as Jung [White, 1944] suggested). On 
the other hand, intrusions seem to be but 
symptoms of the underlying weakening of 
integration, and they project above the surface 
like the top of an iceberg. More significant for 
an understanding of deviant thought is the fact 
that these interfering systems, of which in- 
trusions are only obvious signals, “‘continue to 
pound at the leading systems” (Angyal, 1941) 
and create resistance or friction which slows 
down the thought process and appears in this 
experiment as a decrease in the magnitude of 
the slope constant, m (a slowing in the rate of 
depletion). The slowing effect was reduced for 
all groups when intrusions were ignored; but 
even then significant slowing was still de- 
tectable in the pathological groups, and differ- 
entiation among some of the groups was still 
possible with this measure. 

The half-life, as a measure of efficiency, 
proved to be effective in ordering the groups, 
once an allowance was made for differences in 
volume of output. Babcock (1930, p. 42) com- 
mented on the fact that individuals who have 
a limited supply, such as the cautious para- 
noids, may be efficient in terms of the speed of 
their associations. Babcock’s position is sup- 
ported by present findings, particularly in 
Task II, where the paranoids produced signifi- 
cantly fewer responses than the control but 
still proved to be as efficient. Consistent with 
Babcock’s position, Bousfield, Sedgewick, & 
Cohen (1954) cautioned that efficiency must 
depend not only on the size of the supply but 
also on the amount of interference. 

Heretofore, the problem of precise and ap- 
propriate measurement of speed of verbal re- 
sponse has been difficult, since only gross 
measures of total production were employed. 
As demonstrated in this experiment, the ana- 
lytical technique developed by Bousfield, 
Sedgewick, & Cohen (1954) contributes a new 
dimension to such analyses, taking into ac- 


count temporal aspects of the process of pro- 
duction as well as the end product. In the last 
analysis, these results are in the nature of 
empirical observations which do not contradict 
previous findings in the general area of associ- 
ative production. Rather, they serve to illumi- 
nate the interpretation of other empirical 
findings by increasing our understanding of 
certain previously unexplored deviations in the 
thought processes of the groups concerned. 


SUMMARY 


This study was concerned with timing differ- 
ences in the associative processes of schizo- 
phrenics and epileptics. A recall situation and a 
more general condition of restricted association 
were employed to elicit verbal sequences which 
were analyzed in terms of efficiency of the pro- 
duction. 

The controls were found to differ from the 
patient groups with respect to gross output of 
words, but where the analysis took account of 
the temporal aspects of the process of pro- 
duction, differentiation between groups clearly 
improved. As increased restrictions were im- 
posed on the association process, disturbances 
in the thinking of the pathological groups be- 
came more apparent. 

The results suggest that inappropriate re- 
sponses (intrusions), which appear in the pro- 
ductions of psychotic individuals, are but 
symptomatic of a more basic disturbance in 
thought processes. The interference measure 
employed in this study seems to provide a 
more direct index of the degree of thought dis- 
organization. 

The assumption that interference, resulting 
from the pressure of extraneous associations, 
hampers associative production by slowing the 
rate of responding is discussed within the 
framework of Hebbian theory and in relation to 
other theoretical formulations. 
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VERBAL AND GSR RESPONSES IN EXPERIMENTAL INTERVIEWS 
AS A FUNCTION OF THREE DEGREES OF “THERAPIST” 
COMMUNICATION 


BARCLAY MARTIN, RICHARD M. LUNDY, anp MARK H. LEWIN! 


University of Wisconsin 


HE present study deals with the tendency 

of a subject in an experimental psycho- 

therapy interview to approach or avoid 
coming to grips with therapeutically significant 
material as a function of the amount of com- 
munication he receives from the therapist. 
The process of psychotherapy is formulated in 
terms of an approach-avoidance model similar 
to the conceptions of Miller (1944) and Murray 
and Berkun (1955), in which it is conceived 
as a situation where the client alternately 
approaches significant material and retreats 
from it. No single motive is postulated to 
account for the approach behavior, but rather 
several motives that vary in nature from client 
to client. Thus, the long range goal of relief 
from psychological distress may motivate a 
person to endure the momentary 
in discomfort usually associated with approach 
behavior. There more immediate 
motives, such as a desire to please and gain 
approval from the therapist, a desire to obtain 
advice and reassurance, etc. Avoidance 
havior, on the other hand, is motivated by the 
anxiety, embarrassment, or (more generally) 
the negative affect associated with approach- 
ing therapeutically significant material. It is 
postulated that, as a client alternately ap- 
proaches and retreats from such material and 
experiences the resultant negative affect in a 
therapeutic atmosphere of noncritical accept- 
the negative affect 


increases 


are also 


be - 


ance, understanding, etc. 
associated with these sensitive areas tends to 
extinguish. It is also assumed that this extinc- 
tion of negative affect generalizes to some ex- 
tent beyond the therapy session, and that, 
accordingly, this process is in part responsible 
for what is therapeutic about psychotherapy. 

A question that naturally follows from these 
considerations is, ‘What are some of the fac- 

' The authors wish to thank Irene Waskow, Vilma 
Ginsberg, and Jack Hokanson for their helpful assist- 
ance on this study, and the University of Wisconsin 
Graduate School for providing financial support for 
research leave for one of the authors 
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tors which facilitate or impede approach 
behavior?” The factor that was introduced 
experimentally into the present study was the 
amount of communication or feedback that the 
S received from the therapist. Three groups 
were employed that were thought to represent 
points along this dimension. In one group, the 
S talked to a tape recorder with no therapist 


present, and accordingly received virtually no 


communication from the therapist. In a second 
group, the therapist was present but responded 
by nonverbal means only. In the third group, 
the therapist responded in an essentially 
client-centered fashion, thus providing a maxi- 
mum of communicative feedback compared 
with the other two treatment groups. 

It was thought that an S in Group 1 who 
received no communication from the therapist 
when he engaged in approach behavior would 
soon extinguish and show an increasing tend- 
ency to avoid therapeutically significant 
material. This was predicted because there 
would be no positive reinforcement from the 
nonpresent therapist for approach attempts, 
nor would the S have any way of knowing to 
what extent the nonpresent therapist might be 
implicitly responding in a critical, judgmental 
fashion. The nonverbal therapist condition, 
by providing some communication, was ex- 
pected to result in more approach behavior 
than in Group 1, but less than in the “regular” 
therapy condition of Group 3. 

These expected relationships were to be 
analyzed in terms of approach and avoidance 
behavior trends over the five experimental 
sessions. Thus, the research can be viewed in a 
more restricted sense as a study of the rein- 
forcing effects of three degrees of experimenter 
communication upon affectively toned verbil- 
izations in the Ss. In addition, changes in ten- 
sion or anxiety level, both within and between 
sessions, were assessed by continuous GSR 
recordings. Since various alternative predic- 
tions with respect to increasing or decreasing 
tension levels among the treatment groups 
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seemed reasonable, no specific hypothesis was 
proposed with respect to this part of the study. 


METHOD 

Subjects 

The Forced Choice Manifest Anxiety Scale 
Heineman, 1953) was given to the 240 students in a 
course on the Psychology of Adjustment. At the same 
time, the students were asked to indicate on their IBM 
answer sheets if they would be interested in partici- 
pating in a brief psychotherapy experience as part of 
a research project. From the students who volunteered, 
(about 80% of the class) the 44 who scored highest 
on the anxiety scale were contacted for 15-min. pre 
liminary interviews. At the end of each interview, the 
interviewer rated S on a 5-point scale of severity of 
disturbance and a 5-point scale of defensiveness. No 
Ss were eliminated because of the possible eminence of 
a psychotic break; one S was eliminated because she 
was already on the waiting list at another clinic. None 
of the Ss had experienced psychotherapy within the 
last two years 

The first step was to select the 27 Ss who were rela- 
tively high on disturbance and low on defensiveness. 
Next, Ss were divided into groups of three, with the 3 
Ss in each group matched as well as possible on their 
disturbance and defensiveness ratings. Within each 
triad, they were then randomly assigned to one of the 
three major treatment groups. For each treatment group, 
5 Ss were assigned to therapist A, 3 to therapist B, and 
1 to therapist C. Ten Ss were male, 17 female 


The Treatment Procedures 


Tape recordings and continuous GSR recordings 
were obtained for all Ss in all sessions, including the 
preliminary session. The GSR apparatus consisted of 
an ac circuit designed by Grant (1946) which fed into 
a G.E. Photoelectric Recorder 
were attached to the palmar surfaces of two of S’s left 
hand fingers, which rested comfortably on a low table 
beside the S. The GSR and tape recordings were syn- 
chronized by a timer which operated a marker pen on 
the GSR record and simultaneously made an audible 
click in the tape recorder every 37 sec.? 

Ss were seen for five 30-min. sessions one week apart. 
At the beginning of the first experimental session, Ss 
were given instructions that varied with the treatment 


Dry silver electrodes 


condition as described below: 

Tape Recorder Group. We are asking the subjects in 
different parts of the study to do different things. 
As you know, we are interested in studying various 
factors which affect the process of psychotherapy, 
and in one group, the group you are in, we want to 
study the effects of not having a therapist present 
in the room at all. We would like for you to imagine 
that you are a client in psychotherapy, but instead 
of talking to a therapist we want you to talk into 


? The 37-sec. interval has no particular significance. 
By mistake, the timer was set for this interval rather 
than 30 sec., and had to be continued at this interval 
throughout the study. 
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this tape recorder. This may seem a little strange, 
but try to talk about the things that you would ex- 
pect to in a regular therapy session. People ordinarily 
talk about things that are emotionally important to 
them and how they get along with the important 
people in their lives. But in general, feel free to talk 
about whatever comes to mind. There may be times 
when nothing comes to mind, and that’s O.K. too. 
You can just wait until something does, and then 
talk about that. In general, though, try to keep talk- 
ing as much as you can. After each session I will 
listen to the tape recording of your session. I will 
not talk with you about the session, though. I will 
just start you going each time, but will not talk with 

you about what you say on the tapes, although I 

will be listening to them after each session. 

Nonverbal Group. These instructions were the same 
except for the following modification: 

We want to study the effects of having you talk 
to a therapist who remains silent. So that, as the 
therapist in this situation, I will be listening care- 
fully to what you have to say, but will respond by 
nonverbal means only. In other words, I will indicate 
that I understand what you are saying by nodding 
my head, or saying ‘uh-huh’, etc., but I will not be 
able to say anything out loud. 

Regular Group. These instructions were similar to 
the above except that the S was told that the therapist 
would be responding as he would in ordinary psycho- 
therapy. 

Although outcome assessment was not the primary 
focus of the study, a forced-choice adjective check list, 
hereafter referred to as the Feeling Inventory, was 
obtained on all Ss after the preliminary interview and 
again at the end of the fifth session. This inventory con- 
sisted of 41 sets of three adjectives each with one ad- 
jective in each set representing a tension or anxiety 
dimension. S is asked to indicate which adjective is 
most and which is least like the way he feels. Some evi- 
dence for the sensitivity of this inventory to changes 
in anxiety level had been previously established 

Martin, 1959) 

Ss were instructed to answer the Feeling Inventory 
before the experimental sessions in terms of how they 
had been feeling during the past month. They were 
instructed at the end of the sessions to answer in terms 
of how they had been feeling during the last several 
days. A fourth group of nine Ss who were matched 
roughly on initial score on this inventory with the 
other groups, but who did not attend any experimental 
sessions, were retested after the same amount of time 
had elapsed. 


The Content Ratings of the Interviews 


The following rating scheme was developed after 
considerable trial-and-error practice on other tapes. 
Each 74-sec. interval of an interview was rated in- 
dividually for two major categories of behavior: Ap- 
proach and Avoidance. If approach behavior occurred 
during an interval, a check was placed under the Ap- 
proach column on the rating sheet; if avoidance 
behavior also occurred during the same interval, a 
check was placed in the Avoidance column. 
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In addition to this rather arbitrary time interval 
and the dichotomous scoring of approach-avoidance 
behavior, an additional rating of the extent of approach 
behavior based on more meaningful units of behavior 
was made. To this end, a four-point scale was developed 
for assessing degree of approach to therapeutically 
meaningful material. Raters were required to classify 
the approach behaviors checked under the Approach 
Column according to meaningful themes and to make a 
rating on the four-point scale for each of these “meaning 
units.” 

Some examples of the kinds of behavior representing 
avoidance and approach behavior at the four levels of 
the rating scale are given below. An attempt was made 
to make the four-point scale appropriate to the range 
approach behaviors expected in this particular study 
and sample, and it is likely that some modifications 
would be necessary before applying this scheme to the 
process of longer, regular psychotherapy 

Avoidance Behavior. General discussion of course 
work, campus activities, and other ordinary school- 
related topics without relation to thir own personal- 
emotional conflicts; abstract philosophical or psycho- 
logical discussion; description of other people and their 
personalities for at least one 37 sec. interval without 
relating this to their own problems; description of only 
positive or success experiences for at least one 37 sec. 
interval; any blanket and categorical denial of negative 
feelings with respect to a given person or situation. 

1 pproach behavior, Level 1. In general, indications of 
approach were inferred both from signs of increasing 
negative affect resulting from the approach, and signs 
of verbal approach based upon the raters’ preconcep- 
tions about therapeutically significant material. The 
following are examples of Level 1 approach: (a) talks 
in generalities about self, (6) describes interpersonal 
relationships including those with parents; (c) describes 
historical background material; (d) no feeling or strong 
personal involvement is expressed; (e) may specify 
negative feelings, but qualifies with instances of posi- 
tive or success experience and does not indicate strong 
conflicts. 

Approach behavior, Level 2. Describes clear cut psy- 
chological “symptom” or problem area, for example: 
(a) impotency, nervous vomiting, or specific situations 
in which handicapped by shyness, anxiety, hostility, 
etc.; (6) does not necessarily experience much feeling 
while describing symptom; (c) employs less qualifica 
tion and less emphasis on positive or success experi- 
ences than for Level 1; (d) problem may be externalized 
to large extent. 

A ppreach behavior, Level 3. Considers interpersonal 
aspects of symptoms or problem with some specificity 
accompanied by feeling; this feeling may relate to emo- 
tional reactions associated with the particular conflict 
area or may relate to feelings of discomfort and anxiety 
about going into the area; highly charged expression 
of feeling without much content exploration also scored 
here. 

Approach behavior, Level 4. Shows strong, une- 
quivocal expression and experiencing of feeling asso- 
ciated with approach to conflict area with considerable 
specificity. Level 4 represents an extension and intensi- 
fication of Level 3 behavior 


The 27 Ss were assigned in groups of nine to each of 
three judges for rating purposes. Two of these judges 
were therapists in the research and were assigned Ss 
they had not seen in the interviews. Whenever a judge 
rated an interview, another person set the tape recorder 
up for him so that he would not know which interview 
in the sequence was being rated. If there were initial 
comments in the interview that would provide a clue 
as to the sequence of the interview, the tape was set so 
that the rating began after this point. The first, second, 
fourth, and fifth interviews were rated with ail those 
for a given S done at one time. It was not possible to 
keep the raters blind with respect to the treatment con- 
dition, but they were blind with respect to sequence. 

Reliability estimates of the scores obtained from 
these ratings were obtained as follows. Judges A and B 
jointly, but independently, rated interviews for five Ss 
selected randomly; judges B and C, likewise, rated five 
other Ss. The three basic scores used were: (¢) Meaning 
Unit Rating, based upon the products of meaning unit 
rating (1-4), times the number of the 74-sec. intervals 
involved in the meaning unit and summed for al! mean- 
ing units for a given interview; (6) Approach Behavior, 
the number of intervals during which approach behavior 
was checked in an interview; (c) Avoidance Behavior, 
the number of intervals during which avoidance be- 
havior was checked in an interview. A fourth score, 
change in Meaning Unit rating from the first half to 
the last half of each interview, was also obtained. As an 
estimate of reliability, the three basic scores were cor- 
related for the two judges for each S over the four in- 
terviews, because it was changes within Ss with which 
the study was concerned. Thus, for judges A and B, 
there would be a correlation based on four pairs of 
scores for each of the five Ss for each measure. These 
correlations were averaged, using Fisher’s z transforma- 
tion; the averages ranged from .80 to .94. 


RESULTS 
Curves for the average Meaning Unit ratings 
and intra-interview change in Meaning Unit 
ratings as a function of the four rated inter- 
views are shown in Fig. 1. These curves depict 
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Fic. 1. Meaning unit ratings as a function of suc 
cessive interviews. 
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TABLE 1 
MEANS AND SIGMAS FOR THE PRE- AND PosT- 
ANXIETY SCORES ON THE FEELING 


INVENTORY 
Preexperiment Postexperiment 
Group a 

Mean Sigma Mean Sigma 
Regular 19.6 5.4 14.1 4.2 
Nonverbal 16.3 tua 11.2 6.1 
Tape | 16.6 5.9 14.3 3.8 
Control 16.7 3.3 17.6 3.4 


the Regular group as showing the greatest in- 
crease in both average Meaning Unit ratings 
and in intra-interview change in Meaning Unit 
ratings as the interviews progressed. The Tape 
group showed a slight tendency to decrease in 
both these measures, and the Nonverbal group 
revealed an intermediate amount of change. 

The results for the Approach and Avoidance 
ratings showed similar between-group trends, 
but because statistical analysis showed no sig- 
nificant between-group effects, the curves for 
these variables are not presented. 

For purposes of statistical analysis of the 
trends shown in Fig. 1, the ratings for inter- 
views one and two and those for interviews 
four and five were averaged. Differences be- 
tween these two sets of averages were then 
computed, and an analysis of variance was 
performed to see if the treatment groups dif- 
fered significantly in their average change 
scores. An orthogonal polynomial trend test 
was not performed on these data because the 
assumption of equal intervals was not met, 
interview three not having been rated. The 
between-group changes for the average Mean- 
ing Unit ratings did not reach statistical sig- 
nificance. The between-group changes in intra- 
interview change in Meaning Unit ratings, 
however, were significantly different (p < 
.05). A Multiple Range Test (Duncan, 1955) 
indicated that the Regular and Tape groups 
were significantly different from each other, 
but neither differed significantly from the non- 
verbal group. Thus, if one considers the change 
from the first half to the last half of an inter- 
view in approach to therapeutically significant 
material, one finds that the Regular group 
showed a progressively increasing tendency to 
approach further than did the Tape Group. 

The means and standard deviations for the 


pre- and postexperiment results on the Feeling 
Inventory are shown in Table 1. Although the 
initial mean score for the Regular group is 
higher than the means for the other groups, it 
was not significantly so as tested by the Multi- 
ple Range Test. These Feeling Inventory scores 
were converted into pre- and postexperiment 
difference scores, and the Multiple Range Test 
was then performed. The Nonverbal and Regu- 
lar groups both showed significantly greater 
decreases in reported anxiety than did the 
Control group, whereas the Tape group was 
not significantly different from the Control 
group on this score. 

The skin conductance recordings were scored 
both in terms of general level of conductance 
and in terms of reactivity. Median log conduct- 
ance scores were obtained for all five interviews 
for each S. A change in log conductance score 
was also obtained for each interview by sub- 
tracting the median log conductance score for 
the first half of the interview, from the last 
half of the interview. A reactivity score was 
obtained for each interview by marking off 
four equally spaced 4-min. intervals in each 
interview, obtaining the average of the three 
largest GSR deflections within each log con- 
ductance units, and then averaging these four 
values for an over-all estimate of reactivity. 
A change in reactivity score was obtained by 
subtracting the average reactivity score in the 
first 4-min. period from that in the last 4-min. 
period. Trends of these four scores as a func- 
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Fic. 2. Intra-interview change in log conductance 
as a function of successive interviews. 
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tion of the five sessions were tested for statis- 
tical significance by orthogonal polynomial 
trend tests (Grant, 1956). A significant be- 
tween-groups trend was found only for the 
intra-interview change in conductance scores, 
where a significant between-groups quadratic 
component emerged. Inspection of the curves 
in Fig. 2 shows that the Regular group showed 
an initial rise and then a fall in the variable, 
whereas the Tape group showed an initial fall 
and then a sharp rise in this variable, with the 
Nonverbal group lying in between. 


DISCUSSION 


It is perhaps not altogether appropriate to 
refer to these experimental sessions, even those 
of the Regular group, as psychotherapy. The 
conditions and Ss were rather special and not 
necessarily representative of psychotherapy or 
psychotherapy clients. The more restricted 
problem of studying factors that affect the 
extent to which a person openly discusses 
emotionally important aspects of his life is, 
however, certainly one that has direct impli- 
cations for the more complex therapeutic 
proc ess. 

The intra-interview change in Meaning 
Unit ratings showed significant between-group 
changes from the first two to the last two 
interviews. Stated otherwise, the Regular 
group showed a marked increase in approach 
to emotionally important material from the 
first to the last half of an interview as the five 
interviews progressed, whereas the Tape 
group, although rather erratic, showed a slight 
tendency to decrease on this measure over the 
interviews. Perhaps these results reflect the 
reinforcing effect of an attentive, responding 
therapist who communicates not only positive 
reinforcement for approach behavior, but also 
by his accepting, noncritical attitude aids in 
the extinction of the negative affect associated 
with the approach behavior. For this short 
series of experimental interviews, this “learn- 
ing” appears more pronounced within inter- 
views than when between-interview means are 
compared. Since it is not possible to tell from 
the difference scores whether changes are due 
to increases or decreases in the first or last 
halves of the interviews, the means for the 
first and last halves were plotted separately as 
a function of the successive sessions, and it was 
quite apparent that the between-group trends 


resulted almost entirely from changes in the 
last half of the interviews. In other words, the 
increasing tendency for the Regular group to 
approach further within interviews was not 
due to these Ss starting off at a lower level of 
approach in the succeeding interviews, but 
from their obtaining increasingly higher ap- 
proach ratings in the last half of the inter- 
views. 

The Feeling Inventory results indicate that 
the Regular and Nonverbal groups were in- 
clined to report themselves as having experi- 
enced a larger decrease in anxiety after the 
five sessions than the Control group. An obvi- 
ous possibility is that this self-report is not 
tapping “‘real’”’ changes in anxiety level so 
much as S’s belief, that he should be less 
anxious, or that the experimenters expect him 
to be less anxious. On the other hand, the con- 
sistency of these results with the change in 
Meaning Unit rating scores and the GSR 
data suggest that they may be due to more 
than superficial changes in verbal report. 

The only significant GSR finding was the 
between-groups quadratic component for the 
intra-interview change in skin conductance, 
which is illustrated in Fig. 2. This finding 
suggests that the Tape group showed an in- 
creasing tendency to become more anxious 
within interviews as the five sessions pro- 
gressed, with this intra-interview increase in 
anxiety being especially marked in the last 
two sessions. The Regular group, on the other 
hand, showed increasing  intra-interview 
changes in anxiety through the third session 
and thereafter showed progressively less of an 
increase in anxiety within interviews. 

If this interpretation of the GSR data in 
terms of the hypothetical anxiety state of the 
individual is correct, it raises some interesting 
questions when one compares the GSR results 
with the Meaning Unit ratings of the verbal 
behavior. First, it should be noted that for 
both kinds of data it is the intra-interview 
changes that are significant, not the mean 
scores for the interviews. Second, there appears 
to be in part an inverse relationship between 
the GSR data and the Meaning Unit ratings. 
Thus, in the fourth and fifth sessions, the 
Regular group showed a decreasing amount of 
intra-interview change in anxiety at the same 
time that this group was showing the greatest 
amount of intra-interview increase in approach 
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VERBAL AND GSR Responses TO ‘““THERAPIST’? COMMUNICATION 


to emotionally important material. The Tape 
group, on the other hand, in these last two 
sessions showed a striking increase in intra- 
interview change in anxiety, at the same time 
that it was showing a decrease in the tendency 
to approach emotionally significant material. 
At first glance, observations 
that approaching emotionally important mate- 
rial is less anxiety arousing than avoiding it. 
But the situation is probably much more com- 
plex than that. 

Ss in the Regular group, as they began to 
verbalize emotionally important content, did 
indeed show increasing degrees of intra-inter- 


these suggest 


view increase in anxiety. These results suggest 
that the Tape Ss were not completely with- 
drawing from the psychological field, but were 
experiencing concerns about which they were 
progressively more reluctant to talk. This sup- 
pressing behavior was apparently accompanied 
by a great deal of anxiety. If avoidance be- 
havior on the part of the Tape group Ss came 
to be greater increases in 
anxiety, then one would expect a negative cor- 
relation between intra-interview changes in 
Meaning Unit ratings and skin conductance 
scores for these Ss in the later interviews. The 
correlations between these two variables for 
interviews 4 and 5 were —.47 and —.31 re- 
spectively, which for an NV of 9 are not signifi- 
cant but are in the expected direction. 

This interpretation, as well as the empirical 
findings are consistent with evidence from a 
different study. Gordon, Martin, and Lundy 
(1959) induced conflict areas hypnotically, and 
then by posthypnotic suggestion caused the 


associated with 


Ss in one group in a subsequent interview first 
to “repress,” then to suppress, and then to 
verbalize overtly the conflict material. In a 
second group, order was partly controlled by 
having them first 
verbalize, and then suppress. When skin con- 
ductance for the suppressing condition was 


“repress,” then overtly 


compared with skin conductance for the ver- 
balizing condition for the combined groups, it 
was found that Ss showed a greater increase in 
conductance when suppressing than when 
overtly verbalizing. Both studies, then, support 
the hypothesis that approaching and verbaliz- 
ing conflict areas is associated with less of an 
increase in anxiety than when overt discussion 
of troubling material is avoided. 

More light may be shed on this relationship 
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if a more refined analysis of the data were 
carried out. There is considerable evidence that 
the GSR is positively related to momentary 
changes in the therapy behavior of patients 
(Dittes, 1957; Panek & Martin, 1959), and it 
is doubtful if the present findings should be 
interpreted as indicating a negative relation- 
ship between momentary changes in approach 
behavior and GSR measures. It may be that 
there is some initial rise in conductance or 
reactivity as S first approaches a new therapy 
theme, followed then by an overshadowing 


dex rease. 
SUMMARY 


From a population of college students who 
scored high on the Forced-Choice Manifest 
Anxiety Scale and who also volunteered for a 
brief psychotherapy experience, 27 Ss were 
selected on the basis of an initial screening 
interview and assigned to three treatment 
groups. One group talked to a tape recorder 
with no therapist present; a second group ex- 
perienced a therapist who responded by non- 
verbal means only, and a third group received 
regular psychotherapy. 

Ss were seen for five sessions with continuous 
tape and GSR recordings being obtained. A 
self-report index of anxiety, the Feeling Inven- 
tory, was also obtained before and after the 
five sessions. The verbal content of the inter- 
views was rated in terms of degree of approach- 
ing or avoiding emotionally important areas. 

The major significant findings were that the 
Regular group showed a tendency to increase 
in their intra-interview approach to emotion- 
ally important material as the five interviews 
progressed, whereas the Tape group showed 
the opposite trend, with the Nonverbal group 
lying in between. The GSR data indicated 
that the Regular group showed concomitant 
increases in anxiety as they approached emo- 
tionally important material in the first three 
sessions, but in the last two sessions their even 
further approach behavior was accompanied 
by less and less of an increase in anxiety. The 
Tape group, on the other hand, while showing 
a decreasing amount of intra-interview ap- 
proach behavior as the sessions progressed, 
showed a marked increase in intra-interview 
change in anxiety as the sessions progressed. 
The Nonverbal group showed intermediate 
trends. The Feeling Inventory results indi- 
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cated that the Regular and Nonverbal groups 
reported a greater decrease in reported anxiety 
than a Control group, but the Tape group did 
not differ significantly from the Control group 
on this variable. 
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EFFECT OF INTERPERSONAL RELATIONSHIPS UPON 
VERBAL CONDITIONING: 


ALLAN SAPOLSKY 
Adel phi College 


ECENT findings on the conditioning of 
human that 
verbal reinforcement by an experi- 

menter can modify Ss’ verbal responses in a 
predictable fashion and without Ss’ awareness. 
These findings have been interpreted by Kras- 
ner (1958) as having a direct bearing upon the 
process of psychotherapy. 

However, currently available conditioning 
experiments have not controlled for the possi- 
bility that a relationship between the personal 
characteristics of S and E could affect the con- 
ditioning process. Such a possibility is inherent 
in the fact that, in a therapy setting, both 
positive and negative transference relationships 
are frequently found between patient and 
therapist and are believed to affect the out- 
come of the psychotherapeutic procedure. 

The transference relationship can be viewed 
as one in which S overtly likes (or dislikes) E£, 
or as one in which S is responding to personal- 
ity characteristics which may (or may not) be 
possessed in common by E and S. The positive 
or negative quality of such interpersonal rela- 
tionships could, in turn, be expected to have 
related effects upon the responses of Ss in a 
verbal conditioning situation. 

Work by Back (1951) indicated that instruc- 
tions which orient Ss to like or dislike each 
other can alter the judged attractiveness of an 
individual. In turn, groups composed of 
‘‘attracted” individuals make more effort to 
influence each other and are more susceptible 
to influence. Generalizing this finding to the 
verbal conditioning process, it can be hypoth- 
esized that E exerts more influence (stronger 
conditioning) on an “attracted” group than 
on a “nonattracted” one. The first experiment 
of the present study was performed to test this 
hypothesis. 


subjects have shown 


1 This paper is based on a dissertation submitted to 
the faculty of Adelphi College in partial fulfillment of 
the requirement for the degree of doctor of philosophy. 
The writer is deeply indebted to Marvin A. Iverson, 
under whose supervision the study was conducted; 
Dorothy Cantor, for serving as E in Experiment I; 
and Mary E. Reuder, for invaluable assistance in the 
preparation of this article. 
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In the actual therapeutic situation, however, 
the “attractiveness” (or ‘‘unattractiveness”’) 
of E for S is not customarily established on the 
basis of overtly available information. Rather, 
it appears to develop within the therapy setting 
as the result of an interaction of the personality 
characteristics of S and E. The work of 
Murphy (1947) and Schutz (1958) suggests 
that people are attracted to individuals whose 
personality characteristics they perceive as 
empathic to their own (compatible character- 
istics). Likewise, they find unattractive those 
individuals whose personality characteristics 
are perceived as incompatible with their own. 
If “attractiveness,” based on the perception 
of subtle cues stemming from related personal- 
ity characteristics of S and E, is a factor in the 
verbal conditioning situations, then “com- 
patible” Es should produce stronger condition- 
ing than “incompatible” Es. This was the 
hypothesis which the second experiment in 
this study was designed to test. 

The work of Festinger (1957) on compliant 
behavior suggested that the physical presence 
of E might serve to reinforce the effects of 
whatever personality characteristics were 
operating in the acquisition phase of condition- 
ing, and that such affects might be modified in 
E’s absence. For this reason, both experiments 
were extended to include an extinction series 
with E absent—a condition which parallels 
that of the patient upon leaving the therapy 
setting. 


EXPERIMENT I 
Method 


Subjects. Ss were 30 female college freshmen repre- 
senting a homogeneous group of education majors just 
graduated from high school and living as a group on 
campus. Ss were randomly assigned to receive instruc- 
tions designating E as either attractive cr unattractive. 
Fifteen Ss received High Attractiveness instructions, 
the other 15 Low Attractiveness instructions. 

Materials. A total of 160, white, 3 X 5, unlined index 
cards were used as the stimuli. In the center of each 
card, a different commonly used verb was typed in the 
past tense. Six personal pronouns, “I,” “We,” “He,” 
“She,” “They,” and “You” were typed below each 
verb, and their respective orders were randomized for 
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the entire series of 160 cards. The stimulus cards were 
shuffled after each administration in order to randomize 
the order of presentation for each S 

Experimenter. E was a female graduate student 
whose physical appearance was such that she could be 
easily mistaken for an undergraduate student. She was 
iressed in a manner appropriate for undergraduate 
students on the carpus. The same £ was used for all 
Ss 

Procedure. Just prior to her introduction to E, each 
S was informed by the investigator that this was an 
experiment which required the two persons who would 
be working together to be congenial. S was further told, 
“You will be paired with an experimenter whom you 
will find congenial. We have developed a questionnaire 
which enables us to do this.” 

The FIRO-B, a scale requiring ratings of behavior 
which one desires to find in social situations, was then 
administered and scored in S’s presence 

In addition, S was asked, “Can you describe the 
person you want to work with? What would be the most 
objectionable traits in a person with whom you would 
work?” S was then randomly assigned to either one of 
the experimental conditions 

Under the condition of High-Attraction, S was told, 
“Usually, we can’t match people exactly, but in your 
case this will be possible.” Ss in the Low-Attraction 
group were told, “Usually, we can match people quite 
well, but in your case we’re going to have some trouble 
It’s going to take too long to locate somebody for you, 
so I’m assigning you to Miss C. She may irritate you 
a little, but do the best you can.’ 

After greeting and seating S, E£ read the following 
instructions which were adapted from those of Taffel 
(1955) 

You will see a word in the center of each card. I want 

you to make up a sentence using this word. Below 
the word in the center, you will see a group of other 
Take any one of these and use it to start your 
sentence (pause). Now, it doesn’t matter whether the 
sentence you make up is long or short or even if it 
is complicated or simple. It is important that you 
answer with the first sentence that enters your mind 
It isn’t easy to do this, but you will find that, if you 
try to answer as quickly as possible, you are more 
likely to give the first thing that comes into your 
Everything we say is being recorded 
on a tape recorder, so that it 


words 


mind (pause) 
(point to microphone 
will not be necessary for me to write down your sen 
tences now, but I will have a record of all the sen 
tences you make up (pause). Do you understand the 

directions? All right, let’s begin 

E turned each card after S’s 
obtain a criterion level for the use of the critical pro 
or “We”, the responses of S to the first 20 
cards were not reinforced 
20 cards, E provided reinforcement by saying “mmm 
hmm” in a flat, unemotional tone at the end of any 
sentence which began with the “I’’ or “We” pronouns 
When S had responded to the first 80 stimulus cards, 
the investigator, from behind a one-way vision screen 
simulated a telephone ring in the experimental room 
following manner, 
all right.” 


response In order to 


nouns “I 
For the next three series of 


phone in the 
I’m busy right now 


E answered the 
“Hello yes but 
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She then explained to S that she would have to leave, 
but that S should continue making up sentences until 
she returned. The extinction phase consisted of four 
series of 20 cards each, which S administered to her- 
self while E was out of the room. 

At the conclusion of the experiment, S was inter- 
viewed by the investigator. A series of questions, 
adapted from Tatz (1956), was asked to determine S’s 
degree of awareness of the conditioning procedure. Tatz 
had found a relationship between degree of awareness 
and rate of conditioning. Although, in the present study, 
all Ss were aware of E’s use of ““mmm-hmm,” only three 
reported an association of this behavior with possible 
reinforcement effects. Two of these Ss were from the 
Low-Attraction and one from the High-Attraction 
group. 

As a check on the effectiveness of the instructions 
variable, Ss were also asked to rate E on a self-anchored 
sociometric scale devised by Gardner and Thompson 
(1956). Of 15 Ss receiving High-Attraction instruc- 
tions, 11 rated £ in the predicted direction, and 12 of 
those receiving Low-Attraction instructions responded 
in the manner predicted. A probability test indicated 
significance beyond the .01 level, verifying the effec- 
tiveness of the directions in establishing tne two experi- 
mental groups. 


Results 

The curves of acquisition and extinction for 
both groups are plotted in Fig. 1. The acguisi- 
tion data were then separately treated by 
analysis of variance according to Mixed 
Design I of Lindquist (1956). As can be seen 
in Table 1, the interaction between trials and 
attraction is significant beyond the .01 level. 
The nature of this interaction was further 
examined by applying a separate trend test to 
the slope of the curve of each attraction group. 
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TABLE 1 
SUMMARY OF ANALYSIS OF VARIANCE FOR THE 
Two ConpDiITIONS OF ATTRACTION DvuRING 
THE ACQUISITION PERIOD 


Sum of | Mean 


Source df Squares | Square F ratic 

Total Between-Ss 29 (1125.5 
Attraction 1 | 154.1 |154.1 | 4.43" 
Error Between 28 | 971.4 | 34.7 
Total Within-Ss 90 | 831.0 
Trials 3 | 195.3 | 65.1 |11.03** 
Interaction: T X A 3 | 140.8 | 46.9 | 7.93** 
Error Within 84 | 494.9 5.9 
Total 119 |1956.5 

*p < .05 

"p< 01 

TABLE 2 


SUMMARY OF ANALYSIS OF VARIANCE FOR THE 
Two CONDITIONS OF ATTRACTION DURING 
THE EXTINCTION PERIOD 


: Sum of | Mean . 
Source df Squares denese F ratio 
Total Between-Ss 29 =|1847.9 
Attraction 1 29.0 | 29.0 |}<1.00 
Error Between 28 (1818.9 | 64.9 
Total Within-Ss 120 (1017.2 
Trials 4 26.5 6.6 |<1.00 
Interaction: T X A 4 218.0 | 54.5 7.90** 
Error Within 112 772.7 6.9 
Total 149 (2865.7 
*~ < .05 
es < 


The F ratio thus obtained for the High-Attrac- 
tion group was significant beyond the .O1 
level. The Low-Attraction group did not reach 
the conventional level of significance. 

The obtained interaction is therefore due 
to the fact that the High-Attraction group 
showed a continuously rising curve of acquisi- 
tion of the CR, whereas the Low-Attraction 
group showed relatively little acquisition (see 
(Fig. 1). This finding confirmed the hypothesis 
that Ss who find E attractive use the rein- 
forced pronouns more frequently than Ss who 
find Z unattractive. 

The extinction data were treated by analysis 
of variance, following the same design. Again, 
the interaction between trials and attraction 
was significant beyond the .01 level (see Table 
2). However, examination of the extinction 
curves of the High-and Low-Attraction groups 
(Fig. 1) indicated that this interaction was not 


the same as that found for the acquisition 
curves. Trend tests of the slopes of the two 
extinction curves confirmed this difference. 
The F ratio for the slope of the High-Attrac- 
tion group curve showed no significant depar- 
ture from the horizontal base line, but the F 
ratio for the slope of the curve of Low-Attrac- 
tion Ss was significant beyond the .01 level. 
Thus, the Low-Attraction group may be said 
to have shown the effects of conditioning dur- 
ing the “extinction” period rather than during 
the “acquisition” phase of the experiment. 

The “extinction” data of the High-Attrac- 
tion group were further analyzed to determine 
whether experimental extinction actually 
occurred. The means of each of the extinction 
trials were separately compared with the mean 
of Trial 4 (the final acquisition trial). No sig- 
nificant decrease from the final acquisition level 
was found. The High-Attraction group retained 
the CR level of its original conditioning 
throughout the extinction period. 


EXPERIMENT II 
Method 


Experimenters and subjects. Since the purpose of the 
second experiment was to study the effects of inter- 
personal relationships upon conditioning, it was neces- 
sary to hold constant the relevant personality char- 
acteristics of E and S. This was done by selecting E 
and S on the basis of their score patterns on the FIRO-B 
Scale, which measures needs in interpersonal relation- 
ships (Schutz, 1958). A preliminary study (Sapolsky, 
1958) had shown this scale to be effective in establishing 
compatible and incompatible groupings among in- 
dividuals having only classroom contact with each 
other. 

Five female Es were selected from an undergraduate 
psychology class on the basis of their FIRO-B scores. 
These Es varied in their scores on the Inclusion and 
Affection areas, but all scored high on the Control area. 
E’s role in the conditioning procedure was essentially 
a controlling one; it was therefore deemed preferable 
to choose persons who desired to control interpersonal 
relationships. 

Ss for each E were then obtained by screening a 
variety of undergraduate classes in which E was en- 
rolled. Approximately 300 female students were given 
the FIRO-B, from whom 30 were selected as suitable 
Ss for this experiment. S was selected only if both she 
and her respective E were compatible (or incompatible) 
on all three of the interpersonal need areas measured 
by the FIRO-B(the method of establishing compati- 
bility of need area has been derived by Schutz [1958}). 
Each E was assigned to 3 Compatible and 3 Incom- 
patible Ss with whom she worked individually. This 
procedure resulted in two groups of 15 Ss each—one 
being highly compatible (Compatible Group) and one 
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being highly incompatible (Incompatible Group) with 
its Es 

Since the Ss were drawn from a relatively hetero- 
geneous population, additional control data were gath 
ered. No significant differences between the two groups 
of Ss were found in age, intelligence, extent of acquaint 
ance with E, or Taylor Manifest Anxiety Scale score 

Procedure. The procedure was identical to that in 
Experiment I, except that the “attractiveness” in 
structions were omitted. The awareness questionnaire 
and the self-anchored sociometric scale were again used 
with similar results, as controls on the effectiveness of 
xther procedures 

All but five Ss (2 Compatible and 3 Incompatible 
were aware of the “mmm-hmm” behavior of their E 
Six Ss (3 from each group) recognized the reinforcing 
effect of E’s responses. There was no systematic rela- 
tionship between awareness of conditioning and the 
Compatibility-Incompatibility variable. 

It had been predicted that Compatible Ss and Es 
would like each other and that Incompatible Ss and 
Es would not. On the sociometric scale, 22 of the 30 
Ss rated and were rated by their Z’s in the predicted 
direction. Of the 30 pairs created by combining Es and 
Ss, 19 rated each other as anticipated. A probability 
test showed these results to be significant beyond the 
001 level. This finding established a similarity between 
the relationships studied in Experiment I and Experi 
ment II despite the differences in operations employed 


Results 

The data for Experiment II are graphically 
illustrated in Fig. 2. It can be readily seen that 
there is high similarity between these curves 
and those found in Experiment I (Fig. 1). 

Separate analyses of variance of the acquisi- 
tion and extinction data were carried out ac- 
cording to Lindquist’s (1956) Mixed Design 


Ii! 














rie.2 
d 14 ACQUISITION EXTINCTION 
4 
Ko 134 
SB 24 COMPATIBLE 
~ i! 4 
on 
«Ss 107 
B2 9- 
29 
ze °) 
z 77 
3 6] 
5i 
t ¥ cs 7 7 7 
! 2 3 a 5 6 7 8 
TRIALS 


Fic. 2. Curves showing mean number of reinforced 
pronouns used by the compatible and incompatible 
groups 


TABLE 3 
SUMMARY OF ANALYSIS OF VARIANCE FOR THE 
Two CompatTrsitity GROUPS DURING 
THE ACQUISITION PrRIOD 


Sum of | Mean 


Source dj Squares | Square F ratio 

Total Between-Ss 29 574.9 
Compatibility 1 83.4 | 83.4) 4.44* 
Experimenters 4 46.2 | 11.6 |<1.00 
Interaction: C X E 4 69.3 | 17.3 |}<1.00 
Error Between 20 376.0 | 18.8 
Total Within-Ss 90 725.0 
Trials 3 152.7 | 50.9 | 11.84** 
Interaction: T X C 3 161.4 | 53.8 | 12.51** 
Interaction: T X E 12 78.7 6.6 1.53 
Interaction: TX CX E) 12 77.5 6.5 1.51 
Error Within 60 254.7 4.3 
Total 119 (1299.9 

*p < 0S 

"p< Ol 

TABLE 4 


SUMMARY OF ANALYSIS OF VARIANCE FOR THE 
Two COMPATIBILITY GROUPS DURING 
THE EXTINCTION PERIOD 


Source df Fant } F ratio 

Total Between-Ss 29 +|1423.0 
Compatibility 1 119.8 |119.8 | 2.36 
Experimenters + 75.1 | 18.8 |<1.00 
Interaction: C X E 4 211.6 | 52.9 1.04 
Error Between 20 (1016.5 | 50.8 
Total Within-Ss 120 1034.0 
Trials 4 81.9 | 20.5 2.85* 
Interaction: T X C 4 132.2 | 33.1 4.59** 
Interaction: T X E 16 44.7 2.8 |<1.00 
Interaction:T XCXE) 16 197.7 | 12.4 1.72 
Error Within SO 577.5 7.2 
Total 149 (2457.0 

"p< .05 

** > « | 


The acquisition data (Table 3) show the 
same kind of interaction (significant at the .01 
level) as was found in Experiment I. Trend 
tests indicated that the Compatible Group had 
a significantly rising curve of acquisition, 
whereas the Incompatible Group (with one 
exception) did not. In the latter group, there 
was a significant rise on the first reinforced 
trial which was not sustained in later trials. 
These findings confirm the hypothesis that 
Compatible groups increase their use of rein- 
forced pronouns more than Incompatible 
Groups. They further argue that similar effects 
on the acquisition of a conditioned verbal re- 
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sponse occur when personal attractiveness is 
based on overt instructions as when attraction 
results from compatibility in interpersonal 
needs. 

The extinction data in Experiment II also 
confirm the results of Experiment I. The inter- 
action between trials and compatibility 
(Table 4) is significant beyond the .01 level. 
As in Experiment I, trend tests indicate 
that the increase in response of Incompatible 
Ss accounts for the interaction. Compatible Ss 
showed no appreciable over-all change over 
the acquisition phase. Thus, Incompatible Ss 
in this experiment responded in the same man- 
ner as did Unattracted Ss in Experiment I. 
They showed the effects of conditioning during 
the extinction period. 

The extinction data of Compatible Ss in 
Experiment II showed one major difference 
from that of Attracted Ss in Experiment I. 
When the means of the extinction trials for 
Compatible Ss were separately compared to 
the mean of their Trial 4 (maximum acquisition 
trial) a decrease, significant at the .05 level, 
was found for Trials 6 and 7. Nevertheless, 
the total amount of conditioning was still 
significantly above the initial criterion level. 

DISCUSSION 

The results of the two experiments support 
the major hypothesis that the positive or nega- 
tive qualities of the interpersonal relationship 
between S and £ have related effects upon S’s 
performance in a verbal conditioning situation. 
When there was either High-Attraction or 
Compatibility between S and E, the reinforcing 
value of “mm-hmm” was enhanced, presum- 
ably by the additional reinforcing qualities 
that E represented, and a typical learning 
curve was obtained during acquisition. When 
the Attraction between S and £ was low (or 
S and E were Incompatible), no significant in- 
crease in the use of the reinforced pronouns 
occurred during the acquisition period of either 
experiment. The personal quality of E can here 
be viewed as having taken on the aspects of an 
aversive stimulus which resulted in a decre- 
ment in the observed strength of the condi- 
tioned response. Data from the extinction 
phase of the experiments also supported this 
interpretation. With the removal of the aver- 
sive stimulus (£), the effective quality of the 
‘“‘mmm-hmm” reinforcement was displayed in 


the form of a significant increase in the previ- 
ously rewarded behavior. Thus, the effect of a 
Nonattractive or Incompatible Z was to sup- 
press or counteract the immediate effectiveness 
of the positive reinforcement. This would sug- 
gest that when results derived from condition- 
ing studies are generalized to areas involving 
a relationship between two persons, the quality 
of that interpersonal relationship must be con- 
sidered. 

These results extend and qualify the finding 
of Back (1951), that in more highly attracted 
groups, the members are more receptive to 
influence. An important factor in Back’s study 
was that Ss could terminate face-to-face con- 
tact with each other at their own discretion, 
and thus did not feel forced to remain in the 
influencing situation. Ss in the present study 
were required to remain in the influencing 
situation. The obtained results support Back’s 
finding that less attracted Ss are less willing to 
receive influence. However, it was also found 
that when Ss were required to expose them- 
selves to influence, both groups ultimately 
showed similar susceptibility to the effects of 
the influence, although these effects were de- 
layed in the resistive group. 

This finding suggests that Back’s concept of 
“receptiveness to influence” needs to be refined 
to point up two independent possibilities—not 
only a willingness on the part of S to receive 
influence, but also S’s susceptibility to such 
influence once exposed. The results of the ex- 
tinction phases indicate that individuals may 
actually become influenced but are unable or 
unwilling to display the effects of this influence 
within the social setting. Thus, Ss who fail to 
give anticipated responses may actually be 
suppressing the effects of the major treatment 
until they leave the influential setting. 

Two differences between the results of 
Experiments I and II should be noted. First, 
the initial significant rise in the curve for 
Incompatible Ss did not occur in the curve for 
the Low-Attraction group; and second, the 
curve for the Low-Attraction group of Ss rose 
sharply at the beginning of the extinction 
period, whereas a comparable rise for Ss of the 
Incompatible group was delayed until the 
latter half of the extinction trials. These dis- 
crepancies probably resulted from the nature 
of the experimental procedures. The instruc- 
tions to Low-Attraction Ss immediately 
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focused their perceptions on a dislike of E; 
the emotional situation was explicitly labeled 
for them. There was no such direct focusing of 
attention for Incompatible Ss. Instead, their 
attitudes toward their Zs were necessarily of 
an intrinsic nature and therefore could not be 
as quickly and as overtly evident to them. It 
had to develop as a result of social interaction 
between E and S. 

Krasner (1958) has pointed out the similarity 
between the verbal reinforcement by £ in 
conditioning procedures and the therapist’s use 
of “‘mmm-hmm”’ in the psychotherapy setting. 
The findings of the present study suggest that 
the subtle cues provided by a therapist’s use 
of ‘“‘mmm-hmm”’ are likely to be effective only 
when the interpersonal relationship between 
him and the patient is positive or compatible. 
The results further suggest that, for resistive 
or negativistic patients, an accurate evaluation 
of psychotherapeutic effectiveness may not be 
possible at the time therapy is actually taking 
place. It may well be that for such patients, 
the influence experienced during treatment 
will be expressed only after termination of the 
therapeutic relationship. 


SUMMARY 


An investigation was designed to study the 
effects of interpersonal relationships upon ver- 
bal conditioning. Two conditioning experi- 
ments were conducted in which & used ‘““mmm- 
hmm” to reinforce sentences which S began 
with pronouns “I” or “We’’. Experiment | 
relied upon instructions to create a condition 
of either High- or Low-Attraction between S 


and E. Experiment II utilized the FIRO-B 
scale to establish Compatible and Incompatible 
groups in terms of interpersonal needs. E was 
present during the acquisition phase and ab- 
sent throughout the extinction period. 

Ss who were Incompatible, or not attracted 
to E, suppressed the effects of reinforcement 
until removed from E’s presence. The implica- 
tions of these findings for generalizing from 
verbal conditioning situations to the patient- 
therapist relationship in clinical settings was 
discussed. 
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OPERANT CONDITIONING IN A QUASI-THERAPY SETTING! 


. J. MAURICE ROGERS 


Stanford University 


ECENT studies of verbal conditioning, 
although not designed as investiga- 
tions of the process of psychotherapy, 

have important implications for the under- 
standing of interactions between the therapist 
and the patient. Conducted along the lines of 
Skinner’s operant conditioning paradigm with 
rats and stimulated by Greenspoon’s pioneer- 
ing study (1955), subsequent experimenters 
have amply demonstrated that human verbal 
behavior can be operantly conditioned, that 
such conditioning can be accomplished without 
awareness, that a large variety of verbal be- 
haviors may constitute Skinnerian “responses” 
and an equally large variety of environmental 
events serve as “reinforcing stimuli,” and that 
typical conditioning phenomena (positive and 
negative reinforcement, discrimination, extinc- 
tion, etc.) analogous to that found in animal 
experimentation can be produced. 

Psychotherapists, whatever their theoretical 
affiliation, agree that psychotherapy implies 
effort directed toward the production of per- 
sonality changes and is successful to the extent 
that specified changes occur. Although changes 
have been claimed in a variety of classes of 
behavior, such as anxiety, aggression, and de- 
pendency, the bulk of such claims have been 
unsupported by objective data. Among the 
more systematic studies are those of Rogers 
1951) and his associates (Rogers & Dymond, 
1954), who have repeatedly shown that an 
important change that occurs with psycho- 
therapy is modification of the patient’s self- 
concept. Raimy (1948), who was the first to 
demonstrate a decrease in negative and an 
increase in positive self-references in successful 
therapy, construed this as evidence of a reor- 
ganization of the self, and this view has since 
been strongly advanced by the Rogerian 
school. Reinforcement theory, however, makes 
tenable an alternative explanation—that the 

! This report is based on the author’s dissertation 
for the degree of doctor of philosophy, completed at 
Stanford University June 1958 under a committee con 
sisting of Albert Bandura (Chairman), C. Leland 
Winder, and Paul R. Farnsworth, whose counsel is 
gratefully acknowledged 
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reported changes in self-reference verbaliza- 
tions are elicited via unintentional selective 
reinforcement by the Rogerian therapist. 
Research in operant conditioning of verbal 
behavior is gradually approaching the natural 
clinical setting of psychotherapy itself. In 
order for experimental findings to have maxi- 
mal relevance for both the process and out- 
come of psychotherapy, it is essential that 
conditioning investigations be conducted in 
situations paralleling as closely as possible 
those of psychotherapy itself, yet retaining the 
controls and rigor essential to objective scien- 
tific study. Under such conditions, research 
findings will be more directly applicable to 
psychotherapy as it is characteristically con- 
ducted without the necessity of translating 
them into other terms and another frame of 
reference with the consequent hazard of over- 
extrapolation. Manipulation of actual therapy 
itself, however, is limited by the clinician’s 
ethical responsibility toward the paiient, 
which compels him to exercise restraint in 
introducing conditions which conceivably 
might produce adverse effects in the patient. 
It was with these considerations in mind that 
the present experiment was designed and 
carried out. The hypotheses tested were, that 
an interviewer can produce changes in a sub- 
ject’s self-reference verbalizations by con- 


sistently reinforcing a particular class of such 
statements with simple stimuli, and that such 
reinforcements can alter a subject’s concept 
of himself, as measured by personality tests. 


METHOD 


The experiment consisted of setting up a quasi- 
therapy situation involving a series of free-interviews 
with individual Ss in which all self-reference verba'iza 
tions were reinforced by the interviewer with a simul- 
taneous ““_Mm-hm” and nod of the head. Each S was 
told that the interviewer was making a study to learn 
how people think and feel about themselves and was 
asked to describe spontaneously his personality char- 
acteristics and traits without questions or comments 
from the interviewer. Every interview was tape re- 
corded so that content analyses could subsequently be 
done directly from the tapes. 

Ss were 36 male students drawn from the introduc- 
tory psychology course at Stanford University. They 








were randomly assigned to two experimental groups 
and a control group—Group A, reinforced for positive 
self-references; Group B, reinforced for megative self- 
references; and Group C, not reinforced at all. A total 
of 216 ten-minute “therapy” interviews (6 per S) were 
tape recorded, and a detailed content analysis of all 
self-references was made to ascertain if any relation- 
ship existed between self-references*? and _ reinforce- 
ments 

A battery of four personality tests was administered 
prior and subsequent to the therapy to see if changes 
would occur, and whether such changes would be related 
to self-reference changes in therapy. Two of these tests, 
Adjective Self-Descriptions and Sentence Completions, 
were designed by the experimenter to induce the ap- 
propriate “set” for the ensuing therapy interviews by 
eliciting self-referent ideation. The other two tests 
were utilized to provide measures of personality change 
These tests were the 50-item form of the Taylor Scale 
of Manifest Anxiety (Taylor, 1953) and the Q Sort 
Emotional Adjustment Test. The latter is a universe of 
100 self-reference statements developed by Haigh and 
Butler and converted by Dymond into an adjustment 
test (Rogers & Dymond, 1954, p. 76). 

Finally, a structured interview, likewise tape re- 
corded, was conducted to ascertain Ss’ awareness of 
the conditioning and their reaction to the total experi- 
ment. All Ss were individually tested and interviewed, 
four 1-hr. sessions being conducted with each S. 

In scoring all the self-reference categories in the in- 
terviews and in the Sentence Completion Test, reli- 
abilities were established by correlating ratings of 
two independent judges. Product moment reliability 
oefficients ranged from .83 to .95. In scoring self- 
references in the Adjective Self-Description Test, coding 
was done by three independent judges, the code 
assigned each item being based on majority agreement 
of the judges. Agreement was high, there being unanim- 
ity on almost three-fourths of the items. 

RESULTS 
Effect of Reinforcement on Self-References in 

Therapy 

Table 1 shows the mean therapy interview 
scores for the three groups. The response 
classes signify positive, negative, and ambigu- 
ous self-references, other or non-self-refer- 
ences, and total number of responses. Inter- 
view 1 represents S’s operant level since the 
interviewer was silent throughout that inter- 
view. Interviews 2 through 6 were conditioned 

* “Self-reference” was operationally defined as a 
verbal response by S which describes him in some way, 
tells something about him, or refers to some affect he 
experiences. A self-reference may be positive (favor- 
able), negative (unfavorable), or ambiguous (neither). 
The unit to be coded was termed a “response” and was 
defined basically as a grammatical sentence. This was 
a synthesis of “Sentence-Unit” and “Thought-Unit” 
scoring (Dollard & Mowrer, 1947), on which they 
achieved high rater reliability 
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TABLE 1 
MEAN THERAPY INTERVIEW SCORES FOR THE 


THREE GROUPS 
Response Class P% N% A% O% Total 
Experimental Group A 
Interview 1 27.31 33.77 27.07 11.77 | 47.92 
Interviews 2-6 30.47 23.29 22.78 23.70 | 53.20 
Change 3.16 |—10.49 | —4.29 11.93 5.28 
Experimental Group B 
Interview 1 31.53 27.22 34.21 7.04 | 61.17 
Interviews 2~€ 22.01 37.47 27.19 13.33 | 61.15 
Change ~9.52 10.25 —7.02 6.30 | —.02 
Control Group ( 
Interview 1 33.17 33.07 25.37 8.40 | 54.00 
Interviews 2-6 18.93 | 29.67 | 29.86 21.53 | 54.75 
Change —14.23 | —3.39 4.49 13.14 1S 
TABLE 2 


t VaLuES OF DIFFERENCES BETWEEN THERAPY 
INTERVIEW CHANGE SCORES 


Total 
Response Class P% N& A% O% — 
Responses 
Group A vs. Group C | 3.27" | 1.86 1.76 30 1.18 
Group B vs. Group C 81 3.40°*| 2.50° | 1.71 23 
Group A vs. Group B | 3.45" | 5.24” 55 1.44 1.31 


* Significant beyond the .05 level. 
** Significant beyend the .01 level 
* Significant beyond the .005 level 
© Significant beyond the .0005 level. 


TABLE 3 
t VALUES OF DIFFERENCES BETWEEN OPERANT 
AND CONDITIONED RESPONSES 


Response Class P& N&% 
Group A 1.02 1.60 
Group B 2.90** 1.78* 
Group C 3.9" 59 


* Significant beyond the .05 level 
** Significant beyond the .01 level 


interviews for the experimental Ss in Group 
A and Group B, but were unconditioned for 
the control Ss in Group C. 

In Table 2 the changes in each group are 
compared with those in the other groups and 
the significances of those comparisons deter- 
mined by means of the / test. 

In Table 3 the operant response level 
(Interview 1) is compared with the conditioned 
response leve] (Interviews 2 through 6) in 
each group and the significance of the change 
determined by the / test. 


ses 
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When the data are examined regarding the 
outcome of reinforcing negative self-references, 
a straightforward conditioning effect is found. 
From Table 3, it is seen that Group B, rein- 
forced for negative self-references, showed a 
significant increase in that response class, 
while Groups A and C showed no significant 
change in that response. Table 2 shows that 
the change from operant to conditioned level 
in Group B was significantly different from 
the change in Groups A and C. 

When the data are scrutinized for positive 
self-references, the picture is more complex. 
As shown in Table 3, Group A (reinforced for 
positive self-references) did not increase in 
that response class, and Groups B and C each 
showed a significant decrease in the response. 
From Table 2, however, it is seen that the 
change from operant to conditioned level in 
Group A is significantly different from the 
change in Groups B and C. It is clear, there- 
fore, that the significance of the differences 
between the changes in the three groups is 
attributable to the decline in both Group B’s 
and Group C’s positive self-references rather 
than to the increase in Group A’s positive self- 
references. Without reinforcement, positive 
self-references may tend to _ extinguish, 
whereas, with reinforcement, their extinction 
is arrested. The prediction that the inter- 
viewer’s reinforcement of the S’s positive 
self-references would increase their frequency 
is thus only obliquely confirmed. 


Relationship of Conditionability to Initial Level 
of Anxiety and Emotional Adjustment 


It was predicted that conditionability 
(i.e., the readiness with which the S responded 
to reinforcement) is positively correlated with 
initial level of anxiety and negatively corre- 
lated with initial emotional adjustment. In 
other words, anxiety and poor emotional ad- 
justment should facilitate the conditioning 
process. Rank difference correlation coeffi- 
cients were computed between the Taylor 
Anxiety Score and conditionability, and be- 
tween the Q Sort Adjustment Score and con- 
ditionability. The data did not support the 
hypothesis—no significant relationship was 
found between conditionability and anxiety, 
or between conditionability and emotional 
adjustment for either experimental group. 


Effect of Therapy Conditioning on Personality 

Tests 

Reduction in anxiety and improvement in 
emotional adjustment as a result of the condi- 
tioning in therapy was predicted. Accordingly, 
pre- and posttherapy scores were compared on 
the Taylor Scale and on the Q Sort Adjustment 
Test. No significant changes were found. It 
was noted, nonetheless, that on the Q Sort 
both experimental groups showed improved 
adjustment, the change in each group, how- 
ever, not being significantly different from 
that in the control group. This suggests that 
with a longer period of conditioning—i.e., 
more or lengthier interviews—adjustment 
scores may be significantly increased. 

To determine whether the conditioning of 
self-references within therapy would generalize 
to self-references outside of therapy, pre- and 
posttherapy scores were compared on the 
Adjective Self Description Test and on the 
Sentence Completion Test. Again, no signifi- 
cant changes were found. 

Thus, it may be concluded from the data 
that the effect of the reinforcing stimuli was 
confined to the interview itself. 


Consistency of Personality in Interview and 
Tests 


By correlating the S’s responses to the four 
psychological tests with his wunconditioned 
responses in quasi therapy, i.e., in Interview 1, 
it was possible to measure the consistency of 
personality. As would be anticipated, the data 
shows considerable, though by no means 
complete, personality consistency. The well 
adjusted person, it was found, has less manifest 
anxiety and refers to himself in more favorable 
terms in the free-interview, as well as in adjec- 
tives and in sentence completions, than the 
less well-adjusted person. Similarly, the person 
who refers to himself most unfavorably in the 
free-interview talks least. These, of course, 
are hardly revelations, being common clinical 
observations by psychotherapists. It may be 
somewhat comforting to clinicians, however, 
to have objective confirmation of some of their 
views. 


Experimental Ss’ Awareness of the Conditioning 


“Awareness” was operationally defined as 
an explicit statement by S that the interviewer 
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responded with “Mm-hm”’ or with a nod when- 
ever S made a positive (or negative) self- 
reference. From the recordings of the struc- 
tured interviews, it was determined that not 
a single S was aware of the head-nod condition- 
ing; no one even mentioned it. Since Ss in the 
main tended to avoid looking at the inter- 
viewer during the therapy interviews, this is 
not surprising. As for the ‘“‘Mm-hm”’ condition- 
ing, only 2 of the 24 experimental Ss met the 
operational criterion for awareness, although 
23 of them reported that they had noticed the 
“Mm-hm.” Most of the latter interpreted the 
sound as a positive, rewarding response, sig- 
nifying one or more of the following: a sign of 
encouragement or approval from the inter- 
viewer; an indication that the interviewer was 
listening, interested in, in agreement with or 
understanding of what S was talking about; 
or a sign that S was “on the right track” in 
talking about his personality characteristics 
rather than about extraneous matters. In 
short, while denying more categorical influence 
by the interviewer and failing to connect the 
“Mm-hm” with the specific verbal response 
class which it followed, they described the 
“Mm-hm” as a kind of generalized reinforcer 
which, by encouraging or reassuring them, 
helped them to continue talking or to remain 
on the over-all topic of their personality. 

A few Ss interpreted “Mm-hm”’ idiosyn- 
cratically: one thought it meant different 
things at different times—that the interviewer 
agreed with him, wanted him to speed up, or 
wanted him to change the topic. Another 
thought that it sometimes was meant as 
agreement and encouragement to continue 
with the topic, sometimes as a sign that he 
should drop the topic, and sometimes “to 
distract me.” A third felt that it occurred 
“in disconnected places.” A fourth held that 
it was intended “to get me off balance and 
upset my thinking... you were trying to see 
if I was easily disturbed.” 

Some Ss insisted that they were entirely 
uninfluenced by the interviewer’s “Mm-hm.” 
One, affirming that the “Mm-hms”’ had had, no 
effect on him, asserted, “I was just talking 
what I felt. I didn’t care whether you agreed 
with me or not.” Ancther declared, “I think 
if you had walked out of the room and given 
me the mike and stop-watch and asked me to 


talk for ten minutes about myself, I would 
have said exactly the same thing.” 

The single S who did not report that he had 
even noticed the “Mm-hms” declared that the 
interviewer was paying no attention at all to 
what he was saying but was focusing solely on 
his bodily manifestations of emotional tension. 

It will be remembered that in the therapy 
interviews with the control group, the experi- 
menter made no response whatsoever to S’s 
verbalizations except the generalized response 
of attention. The twelve subjects in this group 
unanimously asserted that the experimenter 
had in no way influenced their verbalizations. 

In their reaction to the experimenter, how- 
ever, they fell into two subgroups: those who 
reported that they pretty much ignored his 
presence, and those who reported that they 
were sensitive to his presence. The latter felt 
the experimenter was observing them closely 
either to detect signs of “‘nervousness”’ or “‘to 
analyze” what they were saying. The former 
tended to describe themselves as so absorbed 
by their self-verbalizations that they rapidly 
became oblivious to the presence of the experi- 
menter. 

DISCUSSION 

The major finding of this experiment, that 
S’s verbalizations are influenced by an inter- 
viewer’s reinforcements, is consistent with the 
evidence already accumulating in human 
operant conditioning investigations. This 
finding has considerable relevance for psycho- 
therapy since changes in patient verbalizations 
are characteristically adduced as proof of 
personality change via psychotherapy. The 
various schools of psychotherapy generally 
attribute such changes primarily to monadic, 
intrapsychic events in the patient; ignoring, 
denying, or minimizing the immediate influ- 
ence of the therapist. Even when they do 
acknowledge it, they typically fail to specify 
how the therapist is instrumental in determin- 
ing the nature of the material elicited from the 
patient or the directions in which the patient 
changes. Since both positive and negative 
self-references were shown to be modifiable by 
interviewer reinforcement, one may ask if such 
changes signify, for example, changes in the 
Self, as the Rogerians assert, or if they are 
correlated with more general changes in the 
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patient’s motivational and attitudinal charac- 
teristics? Can one expect changes in behavior, 
verbal or otherwise, outside therapy to be 
correlated with verbal changes in therapy? 

The present experiment suggests that the 
influence of minimal reinforcements, such as 
“Mm-hm,” is restricted to the interview itself, 
for there were no correlative alterations of non- 
interview self-references or of personality test 
scores. Thus, as W:nder (1957) put it, “One 
line of thought would consider psychotherapy 
as a complex and not always efficient condi- 
tioning process in which the patient learns to 
talk differently and little else.” The present 
data may be construed as supporting such a 
generalization. On the other hand, psycho- 
therapists could argue that personality modifi- 
cation results only from intensive and pro- 
longed therapy and is not to be expected from 
a few short interviews. 

The finding that conditioning took place 
without awareness (all Ss were unaware of the 
head-nod conditioning and all but two were 
unaware of the ““Mm-hm” conditioning) con- 
firms what most verbal conditioning studies 
have previously found. It thus appears that 
changes in verbal behavior can be produced 
without S knowing that his behavior has been 
manipulated. 

Since the data did not show a relationship 
between conditionability and initial level of 
either anxiety or emotional adjustment, no 
light is cast on the question of what personality 
variables in S facilitate or impede verbal 
conditioning. The results regarding anxiety 
conflict with those of Taffel (1955), whose data 
confirmed the thesis that anxiety connotes 
heightened responsiveness to stimuli. He 
found a positive relationship between manifest 
anxiety and conditionability. The fact that his 
Ss were hospitalized psychiatric patients while 
the present ones were “normals’’ may account 
for this discrepancy. Presumably, his Ss had 
higher anxiety, and the mean level of anxiety 
here may not have been of sufficient magnitude 
to exercise a discriminatory effect on condi- 
tionability. 

As Krasner stated (1955, p. 21), “It sounds 
like the simplest and most axiomatic thing to 
say about psychotherapy, but it would seem 
reasonable to assume that the factor of the 
therapist listening, paying attention, showing 
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some interest, is a basic and indispensable 
variable of the therapeutic situation.” From 
the experimenter’s experience with the control 
group, however, where he sat silently, making 
no response throughout the therapy, and where 
the majority of the Ss reported themselves so 
absorbed in their own verbalizations that they 
became oblivious of his presence, he is tempted 
to challenge this axiom. Certainly, it would be 
worth investigating whether or not Ss could 
verbalize as productively in the absence of an 
“interviewer” and whether the character of 
such verbalizations would differ systematically 
from those made with an interviewer present. 

An equally heretical notion is that a form 
of psychotherapy might be devised which did 
not require the “therapist” to acquire clinical 
knowledge and training but merely to learn 
how to administer simple reinforcements (like 
“Mm-hm”’) to specified classes of verbal re- 
sponse! If subsequent investigations were to 
disclose a significant relationship between con- 
ditioning of particular classes of verbal re- 
sponse in the interview and desirable changes 
in personality test scores, it might well be that 
a simple verbal conditioning technique could 
be advantageously employed for circumscribed 
therapeutic purposes. 

It may be argued that the quasi therapy was 
more quasi than therapy and that, therefore, 
the findings have little but analogic meaning 
for psychotherapy. Certainly, the situation 
was not structured to S as therapy, but rather 
as a laboratory experiment in personality. The 
Ss themselves had not, ostensibly, sought 
therapy. On the other hand, the task for the S 
was almost identical with that in psycho- 
therapy; to talk about his characteristics and 
traits and how he felt about them. Moreover, 
the behavior of the interviewer—listening 
attentively, nodding is head, saying 
“Mm-hm” at times, or saying nothing—was 
characteristic behavior for all therapists at 
least some of the time. Furthermore, the ex- 
periment did what other verbal conditioning 
experiments have not done, that is, dealt with 
areas of conflict (negative self-references), 
which is genuinely relevent to psychotherapy. 

The demonstration that experiments in 
operant conditioning can be performed in 
therapy-like situations suggests the extension 
of such experimentation to actual psychother- 
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apy. Tentative beginnings in this direction 
have been made with psychotics (Lindsley & 
Skinner, 1954; King, Merrell, Lovinger & 
Denny, 1957). It seems proven that at least 
some aspects of the complex phenomena re- 
presented by psychotherapy are amenable to 
behavioral analysis and experimental manipu- 
lation. Further investigations utilizing operant 
conditioning designs are indicated since such 
experimental designs appear to have heuristic 
value 


SUMMARY 


The purpose of this experiment was to test 
the hypotheses that an interviewer can produce 
changes in a S’s self-reference verbalizations 
by applying simple reinforcements, and that 
such reinforcements can alter a S’s concept of 
himself. 

The experiment consisted of setting up a 
quasi-therapy situation by conducting a series 
of free interviews with individual Ss in which 
operant self-reference verbalizations were rein- 
forced with a simultaneous “Mm-hm” and 
head-nod. All interviews were tape recorded. 
A battery of four personality tests, two 
designed to induce “‘set’”’ and two chosen as 
measures of anxiety and emotional adjust- 
ment, was administered prior and subsequent 
to the therapy. Finally, a structured interview, 
likewise tape recorded, was conducted to as- 
certain the S’s awareness of the conditioning. 

Ss were 36 male university students in an 
introductory psychology course, randomly 
assigned to two experimental groups (one 
conditioned for positive and the other for 
negative self-references) and to a control 
group (not conditioned). A total of 216 ten- 
minute therapy interviews (6 per S) were re- 
corded and a detailed content analysis of self- 
references made. Scoring reliabilities were 
established by correlating ratings of independ- 
ent judges. The results were as follows: 

1. Operant conditioning of verbal behavior 
in a quasi-therapy situation was demonstrated 
by establishing, beyond the .01 level, that 
negative self-references could be increased by 


interpolating simple reinforcing stimuli and 
that such conditioning could be accomplished 
without the S’s awareness. It was further 
shown, beyond the .01 level, that whereas 
positive self-references without reinforcement 
tended to extinguish themselves in quasi- 
therapy, with reinforcement their extinction 

could be arrested, this too without the S’s 

awareness. 

2. Conditionability was not related to initial 
level of either anxiety or emotional adjustment. 

3. Successful conditioning of self-references 
in therapy did not modify self-references out- 
side of therapy or alter scores on tests of 
anxiety or emotional adjustment, indicating 
that the effect of the reinforcing stimuli was 
confined to therapy itself. 

4. Consistency of personality was shown by 
Ss’ unconditioned responses in therapy and in 
psychological tests. 
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FEAR OF FAILURE AND UNREALISTIC VOCATIONAL ASPIRATION! 


CHARLES H. MAHONE 


Purdue University 


HIS study is concerned with the applica- 

tion of a theory regarding fear of failure 

to the problem of vocational choice. 
Atkinson’s usage (1958b) is followed in regard 
to fear of failure, which is defined as a motiva- 
tional disposition to be anxiously concerned 
about avoiding failure. This disposition has 
been measured by two kinds of instruments: 
(a) the self-report measure of achievement- 
related anxiety, such as the Mandler-Sarason 
Test Anxiety Scale (Mandler and Sarason, 
1952) or the Alpert Achievement Anxiety 
Scale (Alpert, 1957); and (6) the scoring of 
thematic apperception stories for achievement 
imagery (McClelland, Atkinson, Clark, & 
Lowell, 1953). In the latter case, fear of failure 
is inferred from low n Achievement? scores, 
for achievement anxiety is likely to be higher 
than positive achievement motivation for 
this group (Atkinson, 1958b, p. 332). 

Two of the major factors that determine a 
person’s suitability for a particular vocation 
are ability and interest. Therefore, if one is to 
be considered realistic in his choice of occupa- 
tion, he should have ability commensurate 
with that required for substantial success in 
the occupation, and his interest pattern should 
be congruent with that of most people in his 
chosen field. The thesis of this paper is that 
persons who are fearful of failure tend to be 
unrealistic in their vocational choice with 
respect to both ability and interest. 

With respect to the factor of interest, at 
least two studies (Atkinson, 1953; McClelland 
& Liberman, 1949) have shown that persons 
with relatively high fear of failure (as inferred 

1 This article has been adapted from a dissertation 
submitted in partial fulfillment of the requirements for 
the degree of doctor of philosophy at the University 
of Michigan (Mahone, 1958). The author wishes to 
thank John W. Atkinson for his support and guidance 
in the research and in the preparation of this report. 
The author is also appreciative of the grant made for 
this study by the H. H. Rackham School of Graduate 
Studies at the University of Michigan. 

* The abbreviation n Achievement stands for the 
need for achievement or the achievement motive. These two 
terms are used interchangeably as synonymous through- 
out the text. 
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from low n Achievement) tend to avoid con- 
sideration of achievement-related information. 
Fearful persons should therefore show a rela- 
tive lack of relevant information concerning 
the kinds of interest satisfaction to be found 
in the various occupational areas. They should 
thus be likely to choose occupations only re- 
motely related to the kinds of gratifications 
that they desire and expect to find in their 
vocations. 

As for the ability factor, two considerations 
underlie the prediction. As in the case of 
interest, the fearful person may be expected 
to lack information concerning his own ability 
and that required for his choice of occupation. 
The second reason is based on theory regard- 
ing the level of aspiration of the fearful person. 
Atkinson (1958b) has offered a theoretical 
model for predicting level of aspiration from 
the relative strengths of fear of failure and 
need for achievement. He has suggested that 
for persons with higher positive than negative 
achievement motive, the aroused motivation 
to approach success is at a maximum when the 
achievement task is of intermediate difficulty. 
For persons with relatively high negative 
achievement motive (i.e., fear of failure), the 
aroused motivation to avoid failure is also at 
a maximum when the task is of intermediate 
difficulty. If both a person’s tendency to ap- 
proach success and his tendency to avoid 
failure are simultaneously aroused, as they 
are in competitive achievement situations, 
his behavior when confronted by a task of 
intermediate difficulty clearly depends upon 
which motive is stronger. 

Applying this theory to level of aspiration, 
the fearful person (more strongly motivated 
to avoid failure than to achieve success) 
should tend either to overaspire or to under- 
aspire (i.e., to avoid the intermediate range of 
the risk continuum). This prediction has been 
confirmed in several studies of risk-taking 
behavior in the experimental laboratory on 
simple competitive games (Atkinson, 1958c; 
Atkinson & Litwin, 1958; Clark, Ricciuti, & 
Teevan, 1956; Litwin, 1958; McClelland, 
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1958). As yet, however, there is no theoretical 
basis for differentially predicting under- vs. 
overaspiration. 

Let us regard the level of ability required 
by a person’s vocational choice as an index of 
his level of aspiration. The fearful person may 
then be expected to aspire either to an occu- 
pation which he feels very confident about at- 
taining (underaspiration) or to an occupation 
which he considers very difficult to attain 
(overaspiration). That is, he should avoid 
the choice of an occupation that represents an 
intermediate risk of failure for him—the most 
“realistic” choice in the sense that it falls 
within the ‘“‘most probable” range of achieve- 
ment (Lewin, Dembo, Festinger, & Sears. 1944). 
It is in this sense that the person fearful of 
failure is likely to be “unrealistic” with re- 
spect to ability in his vocational choice. 

The following hypotheses are therefore of- 
fered: 

1. A person who is fearful of failure tends to 
choose a vocation that is unrealistic in terms 
of the discrepancy between his own ability 
and the ability judged suitable for the occupa- 
tion to which he aspires. 

This is the most general hypothesis regard- 
ing ability, and the confirmation of this hy- 
pothesis should constitute evidence consistent 
with the parts of the theory concerning risk- 
taking and/or lack of information. 

2. A person who is fearful of failure tends to 
choose a vocation that is unrealistic in terms 
of the discrepancy between his own perceived 
ability and the level of ability that he perceives 
as being suitable for the occupation to which 
he aspires. 

This hypothesis rules out consideration of 
amount and accuracy of information, and the 
confirmation of this hypothesis should consti- 
tute evidence consistent with the risk-taking 
model. 

3. A person who is fearful of failure tends 
to be inaccurate in his estimation of his own 
ability. 

This hypothesis rules out consideration of 
risk, and the confirmation of this hypothesis 
should constitute evidence supporting the as- 
sumptions concerning lack of information as a 
determinant of unrealistic vocational choice. 

4. A person who is fearful of failure tends to 
choose a vocation that reflects a pattern of 
interests discrepant from his own. 


Since interest does not involve risk-taking 
behavior, the confirmation of this hypothesis 
should constitute evidence supporting the as- 
sumptions concerning lack of information as a 
determinant of unrealistic vocational choice. 


METHOD 


Subjects 


The Ss were 135 male college students drawn 
from introductory courses in psychology, French, and 
mathematics at the University of Michigan. They were 
volunteers from a larger group of approximately 330 
male Ss who had participated in another research 
project which involved the administration of the 
thematic apperceptive measure of n Achievement and 
the Alpert Achievement Anxiety Scale. In addition, 
the present Ss were asked to answer the Strong Voca- 
tional Information Blank and a vocational information 
questionnaire. 


Vocational Information Questionnaire 


This questionnaire consisted of a series of questions 
designed to tap various factors associated with the S’s 
vocational choice. The most relevant items of infor- 
mation for the present study were obtained from ques- 
tions that asked S to state: (a) his vocational objective; 
(6) an estimate of the ability required for each of a 
number of listed occupations; (c) an estimate of his 
own general ability relative to that of his fellow stu- 
dents; (d) what his vocational choice would be if ability 
were not a limiting consideration; and (e) which occu- 
pation in the above-mentioned list of occupations re- 
sembled most closely his own vocational choice. On the 
basis of pretest information on the vocational choices 
of a sample of 200 college students, the list of occupa- 
tions was formulated so as to include, if not the actual 
stated vocational choice, at least a closely related occu- 
pation to that chosen by each S. It was the S’s estimate 
of the level of ability required for this listed occupation 
which constituted the measure of estimated ability 
required for own vocational choice 


n Achievement Scale 


Fear of failure has been defined conceptually as a 
relatively stronger disposition to avoid failure than to 
approach success. In order to assess this variable, it was 
necessary to use measures of both positive and negative 
achievement motivation. The n Achievement test, the 
measure of positive achievement motivation, consisted 
of a set of six pictures recently included in a national 
survey by the Survey Research Center at the Univer- 
sity of Michigan. The pictures were selected so that the 
stories could be scored for the achievement, affiliation, 
and power motives (Atkinson, 1958a). The test was 
administered under neutral classroom conditions 
(McClelland et al., 1953). The stories were scored for 
n Achievement by three trained scorers, each of whom 
scored stories to two pictures. A check on the reliability 
of scoring was made by having another experienced 
scorer score all six stories for a group of 30 Ss. The rank 
order correlation between the latter scorer’s total 


vn 


id 
n, 


yn 


FEAR OF FAILURE AND UNREALISTIC VOCATIONAL ASPIRATION 255 


scores and the combined scores of the three original 
scorers was +.80. The mean n Achievement score for 
the present sample was 5.0, and the best median cutting 
point for both the larger group as a whole and the 
present sample was between 4 and 5, which placed 67 
Ss above the median (High n Achievement) and 68 
Ss below the median. 


Debilitating Anxiety Scale 


The measure of achievement anxiety (motivation 
to avoid failure) used was Alpert’s Debilitating Anxiety 
Scale (Alpert, 1957). This scale is a part of Alpert’s 
more general Achievement Anxiety Scale, which was 
designed in an attempt to refine the Mandler-Sarason 
Test Anxiety Scale by differentiating debilitating and 
facilitating anxiety. An early validation study on this 
instrument showed the Debilitating Anxiety Scale to 
correlate +.79 (N = 40) with the Test Anxiety Scale 
(N = 40), and, like the latter, to correlate negatively 
with college grades and performance on achievement 
and aptitude tests (Alpert, 1957). 

The Debilitating Anxiety Scale measures S’s aware- 
ness of the extent to which anxiety interferes with the 
efficiency of his performance in achievement test situa- 
tions. An illustrative item is: “‘In a course where I have 
been doing poorly, my fear of a bad grade cuts down 
my efficiency.” The scale consists of ten such items, and 
S is asked to respond by indicating, on a five-point 
scale, how often he experiences the described situation 
A score of 50 represents the highest degree of debili- 
tating anxiety. For the present sample scores ranged 
from 12 to 39 with a mean of 25.9, and the best median 
cutting point was between 25 and 26. The best median 
cutting point for the larger group, however, was be- 
tween 26 and 27. Since the larger group was more 
heterogeneous and representative of the college popu- 
lation as a whole, the latter cutting point was used, 
yielding 59 Ss in the present sample who scored above 
the median (High Anxiety) and 76 Ss who scored below 
the median. 

The product-moment correlation between n Achieve- 
ment and Debilitating Anxiety was +.09 (NV = 135). 
When Ss were grouped according to their px sition above 
or below the median score on both n Achievement and 
Debilitating Anxiety, 31 Ss were classified as High n 
Achievement-High Anxiety, 36 as High n Achievement- 
Low Anxiety, 28 as Low n Achievement-High Anxiety, 
and 40 as Low n Achievement-Low Anxiety. 


RESULTS 


Judgments of Unrealistic Vocational Choice 

(With Respect to Ability) 

The first hypothesis requires some evalua- 
tion of the discrepancy between S’s general 
ability and the ability required in order to 
achieve his vocational objective. Because no 
adequate information was found in the litera- 
ture concerning the level of ability required in 
various occupations, clinical judgments were 
employed as the major criteria in determining 
realism of vocational choice with respect to 


ability. ‘Two clinical psychologists, both having 
considerable experience in vocational counsel- 
ing, made independent judgments of the real- 
ism of the vocational choice of each of the Ss 
on the basis of the following information: 
ACE score (total of the verbal and quanti- 
tative scores expressed as a percentile), grade 
point average for the preceding semester, 
class, major field, vocational choice, and 
“ideal” choice (i.e., the vocational choice that 
would be made if ability were not a considera- 
tion). The information on “ideal” choice was 
included to permit judgments of ‘“under- 
aspiration” in cases where the student was 
judged to have sufficient ability for a particu- 
lar vocation, even though he rejected it be- 
cause of perceived lack of ability. 

The judges were asked to assign each S to 
one of five categories: Probably Over-Aspiring, 
Perhaps Over-Aspiring, Probably Realistic, 
Perhaps Under-Aspiring, Probably Under- 
Aspiring. They were also asked to indicate 
whether they were Very Confident, Moderately 
Confident, Moderately Doubtful, or Very 
Doubtful about their assignments in each 
case. The investigator, who has had three 
years experience as a trainee in vocational 
counseling, made independent judgments for 
each S to provide a basis for resolving dis- 
agreements between the two expert judges. 

The three judges were in agreement with 
one another in 71-81% of the cases in judging 
the S to be either “realistic,” “‘overaspiring,”’ 
or “underaspiring”. With the investigator 
making the deciding judgment in case of dis- 
agreement between the two expert judges, 
the final ratings consisted of 80 Ss judged to 
be realistic, 45 judged to be overaspiring, and 
10 judged to be underaspiring. Each of the 
three judges agreed with these final ratings in 
at least 81% of the cases. In the analysis, 
ratings of overaspiration and underaspiration 
were combined in a single “‘unrealistic” cate- 
gory. 

According to the first hypothesis, Ss who 
are high in achievement motivation and low 
in anxiety about failure should be judged more 
realistic in aspiring to vocations that are com- 
mensurate with their ability than Ss who are 
low in achievement motivation and high in 
anxiety. The relative frequency of realistic 


3 American Council on Education test of verbal and 
quantitative aptitude. 
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and unrealistic persons in the various motiva- 
tional groups are all in the predicted direction 
(Table 1). A highly significant difference is 
found between the High n Achievement-Low 
Debilitating Anxiety group and the Low n 
Achievement-High Debilitating Anxiety group 


(x? = 7.96, df = 1, p < .003). Debilitating 
Anxiety alone also discriminates between 
Realistic and Unrealistic groups (x? = 10.1, 


df = 1, p < .001), but n Achievement alone 
does not. 


TABLE 1 
n ACHIEVEMENT AND DEBILITATING ANXIETY RELATED 
TO ALL CLINICAL JUDGMENTS OF REALISTIC 
AND UNREALISTIC VOCATIONAL CEOICE 
(N = 135) 


Clinical 
Judgments 


Cinical 
Judgments 
on Ach Anxiety JN — _ —_ 
Real- Unreal-|Anxiety N peal- Unreal- 


istic istic istic istic 
r~ or : ~ 
t 0 t 
High High 31 44 $2 High 59 44 56 
High Low 36 75° 25° Low 76 71 29 
Low High 28 39° s° x? = 10.1 
Low Low 40 8 32 p < .001 
* x? = 7.96 n Ach 
> < .003 High 67 623 37 
Low 68 56 44 
x? = 1.09 
@<.iS 


* Significance levels reported in this table are based on one 


tailed tests since the direction of the differences was predicted 
rABLE 2 
n ACHIEVEMENT AND DEBILITATING ANXIETY RELATE! 
TO CONFIDENT, AGREED-UPON CLINICAL JUDG 
MENTS OF REALISTIC-UNREALISTIC 


VOCATIONAL CHoice (V = 58) 
Clinical Clinical 
Debil Judgments Debil Judements 
a Ach Anxiety Anxiety . 
—_ Real- Unreal y Real- Unreal- 
istic istic istic istic 
Cd C; oY o 
High High ? 56 44 High 21 33 67 
High Low 18 “4° 6° Low 37 80 20 
Low High 12 17° 83° x? = 15.0 
Low Low 19 ‘ 2 » < .0005 
n Ach 
*x? = 15.5* High 27 81 19 
» < 0005 Low $1 52 48 
x? = 5.68 
?< 01 


* Significance levels reported in this table are based on one 
tailed tests since the direction of the differences was predicted 

* Yates correction was used in obtaining this chi square value 
because of the small V 
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Also, since the clinical judgments are obvi- 
ously a rather gross and imperfect measure, a 
more refined analysis was made using only 
ratings on which both expert judges were not 
only in agreement but rated themselves as 
being either “Very Confident” or “Moderately 
Confident.” There were 58 judgments, includ- 
ing 38 “‘realistic’”’, 19 ‘‘overaspiring”, and one 
“underaspiring”’. 

In this more rigorous analvsis (Table 2), n 
Achievement by itself is significantly associ- 
ated with judged reality of vocational choice 
(x? = 5.68, df = 1, p < .01). The degree of 
association between unrealistic aspiration and 
Debilitating Anxiety is also very much en- 
hanced (x? = 15.0, df = 1, p < .0005), and the 
difference between the High n Achievement- 
Low Anxiety and the Low n Achievement- 
High Anxiety groups is striking (x? = 15.5,‘ 
df = 1, p < .0005). 

It may be noted in Table 2 that when Ss are 
classified according to both n Achievement and 
anxiety, the distinctive group is the one low in 
n Achievement and high in Anxiety, which is 
clearly more unrealistic than the other three 
groups combined (x? = 12.0‘, df = 1, p < 
0005). 


Subjective Goal Discrepancy 

The second prediction was that Ss who are 
fearful of failure should avoid choosing occu- 
pations that they perceive as requiring the 
same level of ability as they perceive in them- 
selves. Instead they should favor occupations 
that they perceive as requiring either more or 
less than their own perceived ability. In order 
to test this hypothesis, it was necessary to 
obtain a measure of each S’s estimate of his 
own ability and his estimate of the level of 
general ability necessary in order to attain 
his occupational goal. On the initial question- 
naire all Ss were asked: ““‘What percentage of 
the undergraduates at the University of Michi- 
gan do you feel have sufficient general ability 
(i.e., verbal and quantitative ability) to at- 
tain the following occupational goals, provided 
they were motivated to do so?” As a measure 
of their perception of their own ability, they 
were also asked: “Irrespective of your own 
grade point average, where do you think you 

‘Computed with Yates correction because of the 
small NV. 
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stand in relation to your fellow students in 
general ability?’ Both of these estimates were 
made relative to “fellow students” so that 
they would be comparable. 

The subjective goal discrepancy, then, is the 
difference between the S’s estimate of his 
own ability and his estimate of the ability re- 
quired for his vocational objective. The for- 
mula for the discrepancy score is the per- 
ceived percentage of students with higher 
ability than S minus the perceived percentage 
of students with sufficient ability to attain S’s 
vocational goal. The range of discrepancy 
scores was from —48 (own ability seen as 
superior to that required by job) to +85 (own 
ability seen as inferior to that required by 
job). The median discrepancy score was +7 
if the direction of discrepancy is taken into 
account. If only the absolute size of the dis- 
crepancy is considered, the median discrepancy 
is +12. 

This index is somewhat problematic, how- 
ever, in that it raises the question of whether 
the best test of the hypothesis is made using 
the median cutting point which takes into 
consideration the direction of the discrepancy, 
or the median cutting point which disregards 
direction. If one assumes that the Ss are re- 
sponding to the question of estimating ability 
required by various occupations by estimating 
the average ability required, then the theoreti- 
cal model for risk-taking behavior would 
certainly call for the cutting point which dis- 
regards the direction of discrepancy, since this 
estimate would be analogous to an estimate of 
intermediate risk, and thus large negative dis- 
crepancies would represent underaspiration. 
If however, the Ss are making estimates in 
terms of the minimum ability required, then 
this kind of estimate provides no basis for de- 
termining underaspiration, for occupations 
probably differ markedly in the range of 
ability levels which they can productively 
use. In this case negative discrepancies would 
have to be regarded as realistic, regardless of 
size. Since the basis on which Ss arrived at 
their estimates is not known, analysis of the 
data was made both considering and disregard- 
ing the direction of the discrepancy. 

When the direction of the discrepancy was 
considered, the median cutting point was +8, 
in terms of which 71 Ss were classified as 
“realistic”, and 63 were classified as ‘‘un- 


TABLE 3 
n ACHIEVEMENT AND DEBILITATING ANXIETY RELATED 
TO Positive Supyective Goat D1s- 
CREPANCY (OVERASPIRATION) 


Sana Goal = 
Jiscrepancy | j ~ 
ath Aeckty B Anxiety N Discrepancy 








High Low High Low 
% % % % 


High High 30 $3 47 High 58 7 43 
High Low 36 31° 69° Low 76 39 61 
Low High 28 61° ; x? = 3.97 
Low Low 40 48 52 ’ < .03 
n Ach 
*y? = 5.69 High 66 41 59 
’< .01 Low 68 53 47 
x? = 1.92 
’ < .10 


* Significance levels reported in this table are based on one- 
tailed tests since the direction of the differences was predicted. 


realistic’, which, in this case, means only 
overaspiring (i.e., positive discrepancies®). 
When the direction of the discrepancy was 
disregarded, the median cutting point was 
+13, in terms of which 68 Ss were classified 
as ‘realistic’, and 66 were classified as ‘“‘un- 
realistic’, which, in this case, includes both 
overaspiring (positive discrepancies) and un- 
deraspiring (negative discrepancies) Ss. 

In the positive discrepancy analysis (Table 
3), the hypothesis is confirmed for the combi- 
nation of n Achievement and Debilitating 
Anxiety (x? = 5.69, df = 1, p < .01). De- 
bilitating Anxiety alone is also significantly 
related to this index of overaspiration (x? = 
3.97, df = 1, p < .03), but n Achievement 
alone is not. In the absolute discrepancy 
analysis (both over- and underaspirants con- 
sidered unrealistic), none of these associations 
reached the .05 level of significance when the 
distribution of absolute discrepancies is di- 
vided at the median +13. However, if we 
accept a more stringent definition of the 
“realistic” category which includes only the 
least discrepant third of the distribution of 
absolute goal discrepancies (+8), as suggested 
in a study by Atkinson and Litwin (1958), 
then Debilitating Anxiety alone (x* = 5.36, 
df = 1, p < .02) and the High n Achievement- 
Low Anxiety vs. Low n Achievement-High 
Anxiety comparison (x? = 7.0,df = 1,p < 

5 The total N here is only 134 because one S failed 
to provide an estimate of his own ability. 
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TABLE 4 


n ACHIEVEMENT AND DEBILITATING ANXIETY RELATED 
To ABSOLUTE SuByectTIvE Goa. Discrepancy (IN- 
TERMEDIATE THIRD vs. COMBINED Pos!- 

TIVE & NEGATIVE EXTREMES) 


Absolute 
Goal 
Discrepancy 


Absolute 
Goal 
Discrepancy 


n Ach Anxiety JN Anxiety N / 
Mid Ex Mid Er- 
third tremes third tremes 

High High 0 0 70 High a 24 7% 

High Low ue so* o* Low 6 43 7 

Low High 28 18° 32° xt = 5.36 

Low Low ue 8 62 p< 0? 

n Ach 
*? 0 High 66 41 59 
P< .& Low 68 29 i 
x? = 2.07 
p 10 


* Significance levels reported in this table are based on one 


tailed tests since the direction of the differences was predicted 
.005) show highly significant relationships with 
goal discrepancy (Table 4). n 
Achievement alone does not reliably dis- 
criminate between the mid-third realistic 
group and the over- and underaspirant groups 
combined. However, the trends are all in the 
predicted direction, and it is very clear that 
the only group heavily represented in the 
“realistic” category is the High n Achieve- 
ment-Low Anxiety group. 


subjective 


Inaccuracy in Estimation of Own Ability 

In the initial questionnaire, Ss were asked 
to estimate their standing in general ability in 
comparison with fellow students, in terms of 
percentile rank, which would then be directly 
comparable to Ss’ actual percentile scores for 
combined verbal and quantitative ability on 
the ACE. One measure of unrealism (or in- 
accuracy) is the discrepancy between the 
percentile ranks of the S’s estimate of his 
ability and his actual ability; the latter per- 
centile rank was subtracted from the former. 
The ranks for measured ability ranged from 
the 4th to the 99th percentile, with a median 
at the 61st percentile. The percentile discrep- 
ancies ranged from —47 (underestimation) 
to +75, with a median absolute discrep- 
ancy of +16. There were 35 negative dis- 
crepancies and 99 positive discrepancies*. The 

* The total N here is only 134 because one S failed 
to provide an estimate of his own ability 
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median cutting point was between +15 and 
+16, yielding 68 Ss classified as “inaccurate’’ 
and 66 Ss classified as ‘‘accurate” in estimat- 
ing their own ability. 

The third hypothesis states that Ss with 
high achievement motivation and low anxiety 
are more accurate in estimating their own 
general level of ability than are Ss with low 
achievement motivation and high anxiety. 
This prediction was confirmed (x? = 3.52, 
df = 1, p < .04). Debilitating Anxiety and n 
Achievement taken singly were not signifi- 
antly related to accuracy of estimate (Table 
). The major difference in Table 5 is between 
the Low n Achievement-High Anxiety group 
and the other three groups combined (x? = 
4.6, df = 1, p < .02). 


c 
5 


Unrealistic Vocational Choice (With Respect to 

Interest) 

At the same time that the judges made 
their rating of the realism of vocational choice 
with respect to ability, they also indicated the 
primary and secondary vocational areas on 
the Strong interest test which they felt would 
reflect the interest pattern most congruent 
with the S’s vocational choice. On the basis 
of the S’s scores in these designated areas, as 
evaluated in terms of the pattern analysis sug- 
gested and used by Darley (1941), plus his 
score on the specific occupational key related 
to his vocational choice, the investigator 
divided the Ss into two approximately equal 
groups. One group (NV = 66) was considered 
to be nondiscrepant in their interest patterns 


TABLE 5 


n ACHIEVEMENT AND DEBILITATING ANXIETY RELATED 
TO INACCURACY IN EsTIMATE OF OwN ABILITY 


Inaccuracy Inaccuracy 
n Ach Anxiety JN Hish Lew Anxiety N Hich Low 
c c a / 
High High 30 46 54 High 58 57 43 
High Low 36 44° 56° Low 76 46 54 
Low High 28 67° 33° x? = 1.41 
Low Low 40 47 53 o< .15 
n Ach 
*y? = 3.52 High 66 45 
’< .04 Low 68 56 44 
x? = 1.46 
&’< .15 


* Significance levels reported in this table are based on one 
tailed tests since the direction of the differences was predicted 


a 


— 


+ =e tinct 





— 


eo A ne one 


nA 


(i.e 
wit 
(N 
(i.e 
wit! 
n 
nifi 
inte 
(x? 
diff 
Ack 


real 
dire 
are 

leve 
be : 
mer 
Sub 
shir 
and 
pect 
ship 
Dis 


3.00 


) and 
rate” 
imat- 


with 
xiety 

own 
1 low 
ciety. 
3.52, 
ind n 
gnifi- 
able 
ween 
TOU] 
x? . 


nade 
hoice 
1 the 
Ss on 
ould 
uent 
basis 
S, as 
sug- 
; his 
ated 
rator 
qual 
ered 
erns 


ted 


= 


Ser 


—— 


FEAR OF FAILURE AND UNREALISTIC VOCATIONAL 


TABLE 6 

n ACHIEVEMENT AND DEBILITATING ANXIETY RELATED 
TO REALISTIC-UNREALISTIC VOCATIONAL 
Cuoice (with Respect TO INTEREST) 


Interest Interest 


nAch Anxiety N  Real- Unreal- Anxiety N Real- Unreal- 


istic istic istic istic 
Y Oo % % 
High Higt 31 61 39 High 59 47 53 
High Low 36 56° 44° Low 76 50 50 
Low High 28 32° 68° x? = .12 
Low Low 40 45 55 >< .40 
n Ach 
*,2 = 3.50 High 67 58 42 
>< .04 Low 68 40 60 
x? = 4.55 
’ < .02 
* Significance levels reported in this table are based on one- 
tailed tests since the direction of the differences was predicted 
(i.e., inventoried interests were congruent 


with vocational choices) and the other group 
(VY = 69) was considered to be discrepant 
(i.e., inventoried interests were not congruent 
with vocational choices). 

n Achievement alone was found to be sig- 
nificantly related to congruence (or reality) of 
interest pattern in the expected direction 
(x? = 4.55, df = 1, p < .02), and a significant 
difference was also found between the High n 
Achievement-Low Anxiety group and the Low 
n Achievement-High Anxiety group (x? = 
3.50, df = 1, p < .04). Anxiety alone, however, 
was not significantly related to realism of 
interest pattern (Table 6). 


Relationships Among Indices of Unrealism 


The relationships among the indices of un- 
realism about ability are all in the expected 
direction, and three of the five comparisons 
are statistically significant beyond the .05 
level. The dimensions that do not appear to 
be significantly related are the Clinical Judg- 
ments and both the Positive and the Absolute 
Subjective Goal Discrepancy. The relation- 
ships between the unrealistic interest index 
and the ability indices are also all in the ex- 
pected direction, but only one of the relation- 
ships with Interest, that of Absolute Goal 
Discrepancy, is statistically significant (x? = 
3.00, df = 1, p < .05). 


ASPIRATION 259 
Ability, Fear of Failure, and Indices of Un- 
realism 

The results show that measured ability 
(ACE percentile score) is significantly related 
(negatively) to Debilitating Anxiety (x? = 
11.8, df = 1, p < .001), but is not related to 
n Achievement. Ability is also significantly 
associated with all of the indices of unrealism 
related to ability (p < .005 in all cases). In 
every case the group low in ability is typically 
more unrealistic than the group high in ability. 
However, ACE percentile score is not signifi- 
cantly related to Interest Reality. 

DISCUSSION 

Because of the association between high 
measured ability and low Debilitating Anxiety, 
it is very difficult to distinguish between the 
relationships of ability and those of motivation 
to the three ability-related dimensions of un- 
realistic vocational aspiration. This kind of 
distinction is desirable, however, in view of 
the possibility that the relationships of ACE 
score to Clinical Judgments and to Inaccuracy 
in Estimating Own Ability are artifacts re- 
sulting from the fact that measured ability 
was directly involved in arriving at these two 
indices. 

In view of this possibility, it is important 
to determine whether or not fear of failure is 
related to Clinical Judgments and Inaccuracy 
in Estimating Own Ability independent of 
level of ability. For this reason, an analysis of 
covariance was accomplished to determine 
whether the average level of anxiety in 
“Realistic” and “Unrealistic” groups was dif- 
ferent (as predicted) when the groups were 
equated by this statistical procedure on ACE 
score. The results clearly show that fear of 
failure is related to Clinical Judgments (V = 
135) when ability is held constant (F = 5.40; 
df = 1,132; p < .02'). This means that it 
would not be possible to account for the rela- 
tionship between fear of failure and judgments 
of unrealistic vocational aspiration solely in 
terms of a possibly artifactual relationship 
between judgments of unrealistic aspiration 
and ability. An analysis of covariance was not 
made using the Inaccuracy index to categorize 
realistic and unrealistic groups because an 
inspection of the chi square tables indicated 


7 One-tailed test. 
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that no relationship between this index and 
the motivation measures remained when Ss 
high and low on the ACE were considered 
separately. Most of the relationship between 
this index and the fear of failure measure re- 
ported in Table 6 should be attributed to the 
relationship of both kinds of variables to the 
general ability measure. 

Let us now consider the relationship between 
low intelligence and high anxiety about failure, 
since the interaction of these two variables is 
probably of substantial importance in the 
prediction of unrealistic vocational aspiration. 
One reason why we might expect these vari- 
ables to be related is that relatively low in- 
telligence may cause a person to have more 
early failure experiences and thus to become 
more fearful of failure. A second and perhaps 
even more important factor is that anxiety 
about achievement has on numerous occa- 
sions been shown to interfere with performance 
on aptitude tests (Alpert, 1957; Sarason & 
Mandler, 1952; Zweidelson, 1956). Since our 
measure of ability, the ACE test, is just such 
a performance test taken under competitive 
conditions, we should expect that high anxiety 
would lead to performance decrements, i.e., 
lower ACE test scores. For this reason, when 
we control for measured ability in the analysis 
of covariance we are probably overcontrolling. 
Not only are we ruling out the effect of in- 
nate ability on fear of failure, but we are in all 
likelihood also ruling out the effect of fear of 
failure on the performance measure of innate 
ability. Thus, we have good reason to believe 
that the relationship of fear of failure to un- 
realistic vocational aspiration, independent of 
the effect of innate ability, is even stronger 
than the results of the analysis of covariance 
show it to be. 

The results of the present study confirm the 
applicability of the principles found in the 
laboratory study of level ef aspiration to 
aspiration in real life. 


SUMMARY 
The purpose of this study was to determine 
whether fear of failure in college students is a 
determinant of unrealistic vocationa: aspira- 
tion. 
Vocational aspiration is considered in the 
context of theory having to do with motiva- 


CHARLES H. MAHONE 


tionai determinants of level of aspiration. 
Persons who are fearful of failure are pre- 
sumed to be generally avoidant in their be- 
havior in competitive achievement situations. 
On the one hand, such a person should avoid 
consideration of achievement related informa- 
tion, and, on the other hand, he should prefer 
speculative ventures, where his probability of 
success is quite low (overaspiration), or safe 
ventures, where his probability of success is 
quite high (underaspiration). In contrast, per- 
sons who are relatively strong in motivation 
to achieve success should tend to prefer ven- 
tures where the probability of success is inter- 
mediate. 

Measures of both achievement motive and 
achievement-related anxiety were used in the 
present study. The Ss were 135 male college 
students. They were classified as “realistic’”’ or 
“unrealistic” in vocational aspiration in terms 
of each of four criteria: (¢) judgments by 
clinical psychologists based on the discrepancy 
between the S’s own measured ability and the 
ability judged to be required by the S’s voca- 
tional choice; (6) the discrepancy between the 
S’s estimate of his own ability and his estimate 
of the ability required to attain his vocational! 
goal; (c) the S’s inaccuracy in estimating his 
own ability; and (d) the discrepancy between 
the S’s interest pattern and his vocational 
aspiration. On each criterion of realistic vs. 
unrealistic vocational aspiration, significantly 
more Ss who were low in achievement motiva- 
tion and high in achievement-related anxiety 
were classified as unrealistic, than Ss who were 
high in achievement motivation and low in 
achievement-related anxiety. Thus, the major 
hypothesis was clearly supported. 
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THREE VERSIONS OF THE F SCALE AND PERFORMANCE 
ON THE SEMANTIC DIFFERENTIAL! 
ROBERT E. MOGAR 


State University of Iowa 


N RECENT years inferences of authoritarian 

personality trends from the California F 
Scale have been challenged by a number 
evidence has 


of investigators. Considerable 


accumulated to suggest the importance of 
determinants of F scale performance other than 
attitudes. attention 


has been called to acquiescence response set 


authoritarian Particular 


because of unidirectional wording and scoring 
of the scale (Bass, 1955; Chapman & Camp- 
bell, 1957; Christie, Havel, & Seidenberg, 
1958; Jackson & Messick, 1957; and Leavitt, 
Hax, & Roche, 1955). In studies the 
investigators used reversed | 
so that items in the reversed scales were pre- 


t he se 


scales written 


sumably opposite in meaning from items in the 
Although all of these studies 
seem to indicate that a response set to acqui- 
esce, independent of attitudinal content, plays 


original scale. 


some role in determining scores on the F scale, 
there has been disagreement on the relative 
importance of acquiescence and the role it 


plays in reiation to authoritarianism. 
In a comprehensive critique of this body of 
iristie et al., (1958) conclude that 


resear¢ h, ( 
the lack of consistent positive correlation be- 
tween the F scale and several independently 


constructed reversed F should not be 


scales 
attributed to a tendency to acquiesce. They 
state that the lack of consistent positive corre- 
lation is more a function of inadequate reversal 
so that reversed items may be assessing differ- 
ent rather than opposing content. Brown and 
Datta (1959 
support Christie’s contention. They found that 
correlates of intelligence remain significantly 
related to authoritarianism even with response 


present evidence that seems to 


set removed 
It appears as though the relative importance 
of acquiescence in F scale performance cannot 


be evaluated by using reversed scales until it is 


! This paper is based on a portion of a master’s thesis 
submitted to the Department of Psychology of the 
State University of Iowa. Grateful acknowledgments 
are extended to Alfred B. Heilbrun, Jr., thesis advisor, 
and Harold P. Bechtoldt, who provided invaluable sug 
gestions 


clear that reversed items are comparable to 
the original items in all respects but direction 
of wording. The purpose of the present study 
is to evaluate the adequa¢ y of reversed F 
scales from the standpoint of equivalence and 
validity. Equivalence would require a correla- 
tion between reversed scales approa¢ hing their 
respective reliabilities as well as correlations 
with the original F scale in the same direction 
and of roughly the same magnitude. Validity 
demands significant the re- 
versed F scales with an external criterion of 


correlations of 


authoritarian tendencies 


METHOD 
Authoritarian scales 


Of the five reversed F scales reported in the litera 
ture, the 1955) and Jackson-Messick (1957 
versions seem to represent the most widely varying 


Bass 


approaches to item revision. Bass made relatively minor 
modifications of the content and structure of the 
original items while Jackson and Messick made con 
siderably greater modifications in accordance with their 
desire to preserve only the style of the original state 
ments 

Possible scores on the F and reversed F scales range 
from 1.00 to 7.00 with high scores on eachscale indicating 
authoritarianism. Accordingly, a positive correlation 
between any two scales indicates consistency of re 
sponse to content 


Criterion measure 


The semantic differential consists of a set of con 
cepts Homosexual) which are rated upon a 
list of bipolar Intelligent- 
Stupid). Twenty scales were developed on the basis of 
Osgood’s (Osgood & Suci, 1955) analysis of “meaning” 
and his criteria of selection (Osgood, Suci, & Tannen 
baum, 1957, pp. 76-78). To prevent the formation of 
position preferences, the polarity direction of the 20 
scales was varied throughout the list with respect to 
favorability (e.g., Intelligent-Stupid was followed by 
Bad-Good). 

A postulated tendency of the authoritarian person 
ality is to regard group differences in “all-or-nothing”’ 
terms rather than in terms of gradations (Adorno, 
Frenkel-Brunswick, Levinson, & Sanford, 1950, p 
228), and there is some empirical evidence to support 
this contention (Stagner & Osgood, 1946). Therefore, 
the authoritarian might be expected to justify differ- 
ences by attributing unfavorable traits to outgroups 
in an all-or-none fashion. The outgroups judged in 


e.g., Jew, 


seven-step scales (e.g., 
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this study were Jew, Homosexual, Atheist, Murderer, 
Prostitt te, and Lunatic. In addition to these socially 
controversial concepts, six noncontroversial concepts 
were used (Waztiress, Parrot, Postman, Cat, Fisherman, 
and Dog) to assess any differences in scale-checking be- 
havior as a function of concept controversiality. Ac- 
cordingly, it should follow that Ss scoring high on an 
authoritarian scale would have a greater tendency to 
make extreme judgments than low-scoring Ss and this 
tendency should be greater with respect to contro- 
versial social concepts than with respect to noncon 
troversial concepts. 

A measure of extremity of judgment on the semantic 
differential was obtained for each S by calculating the 
mean absolute deviation from the midpoint of the 
seven-point scales. These values were summed first 
over the 20 scales on each concept, then averaged over 
all six controversial concepts. The values range from 
00 to 3.00, a score of 3.00 indicating an invariable use 
of the extreme positions. A separate mean deviation 
score was obtained over the six noncontroversial con- 
cepts. 

Subjects 
The 315 Ss used in this investigation were selected 


from the undergraduate courses in introductory psy 
chology at the State University of Iowa 


Procedure 


The administration was divided into two phases. In 


the first phase, all Ss were given the semantic differen- 


tial. In addition, 62 of these Ss were given Forms 40 
and 45 of the F scale, 63 Ss the Bass reversed scale 
(Scale B), and another 63 Ss the Jackson-Messick re- 
versed scale (Scale ( 

The second phase, about one week later, involved 
the administration of the F scale, Scale B, and Scale C 
to all Ss with order of presentation counterbalanced 

Thus, at the completion of the second experimental 
phase, all Ss had been given a twelve-concept form of 
the semantic differential, the F scale, Scale B, and Scale 
C. In addition, test-retest data were available on the 
three authoritarian scales 


RESULTS 


The product-moment correlations between 
the three F scale versions and between each F 
scale and the criterion are presented in Table 
1. It can be seen that the two correlations 
between the reversed F scales and the F scale 
are both significantly greater than zero but in 
opposing directions. Furthermore, these corre- 


different from each 
and the relationship 


lations are significant] 
1.99. 4 < 0 


between the two reversed scales (r 


» 
other (é 5 
35) does 
not approach the reliabilities of either scale 
61, Scale C 59). Thus there is 
little support for assuming the reversed scales 
to be equivalent to the F scale in content or to 
be equivale nt to each other. 


(Scale B 


TABLE 1 
INTERCORRELATIONS BETWEEN SCORES ON THE F 
Scate (F), Bass Reversep F Scare (B), 
Jacxson-Messick REVERSED F SCALE 
C), AND EXTREMITY OF ATTITUDE 
RATINGS ON CONTROVERSIAL 
Concepts (E) 

(N = 315) 


Scale F B ¢ E 
F (.67 —" —.14* a 
B (.61) 35° 09 
( (.59) — .04 


Note.—Parentheses indicate test-retest reliabilities 
° 


TABLE 2 
MEAN EXTREMITY RATINGS ON THE CONTROVERSIAL 
AND NONCONTROVERSIAL CONCEPTS OF HIGH, 
Mepium, AND Low F ScALe SCORERS 


Contro- Noncontro- 
F scale scores versial versial 
—_— concepts concepts 
Cut-off points N Mean Mean 
Upper >4.09 57 | 1.45 1.36 
CZ 
LU s : 
Middle 4.09-3.09 197 1.32 1.26 
60% 
Lower <3.09 61 1.16 1.14 
0G 
V0 
Over-all 315 1.32° 1.27* 
® These over-all means differed at the .001 level of significance 
CR = 3.13 


The relationships between the F scales and 
scale-checking behavior failed to provide any 
evidence that the reversed scales are measuring 
authoritarianism. However, the original F 
scale did correlate significantly with the cri- 
terion and the differences between the F 
scale-criterion correlation (.22) and this corre- 
lation for Scale B (.09) and Scale C (—.04) 
were both significant at the .05 level. 

The relative ratings on controversial and 
noncontroversial concepts of high and low 
scorers on the F scale (Table 2) were evaluated 
by cutting the distribution of F scale scores 
at the 20th percentile and at the 80th percen- 
tile, providing comparisons of high scorers 
(upper 20%), medium scorers (middle 60%), 
and low scorers (bottom 20%). The difference 
between mean rating extremity scores for the 
two types of concepts was statistically signifi- 
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cant for both the upper 20% (¢ = 2.25, df = 
56, p < .05) and the middle 60% (¢ = 3.00, 


df 196, p < .01) subgroups. The extremity 
scores for the lower 20% were approximately 
the same for the two groups of concepts. Thus, 
the high and medium F scale scorers tended to 
make more extreme judgments with respect to 
controversial concepts than with respect to 
noncontroversial concepts while low F scale 
scorers did not make differential judgments on 
the two types of concepts. Further, it was 
found that the upper 20% of Ss on the F 
scale showed significantly higher mean scores 
on both the controversial concepts (¢ = 3.63, 
df = 116, p 01) and noncontroversial 
concepts (¢ = 2.44, df = 116, p < .02), than 
did the lower 20% 
DISCUSSION 

The findings of this study offer evidence 
that the Bass and Jackson-Messick reversed 
F scales are not equivalent measures in that 
their correlation (r 35) falls considerably 
short of their restricted reliabilities and since 
they enter into differing relationships with the 
original F 18 and —.14, respec- 
tively). There was also no evidence to support 


scale (r 


the notion that either reversed scale is a valid 
measure of authoritarian tendencies since 
neither correlated significantly with the ex- 
ternal criterion of exiremity of social judg- 
ments. The findings of nonequivalence of re- 
versed scales does not answer which one of 
the two (or perhaps neither) represents an 
adequate reversal, since the extent of the 
original F scale-reversed F scale correlations 
is presumably confounded with acquiescence 
set and other form or content factors. It 
should encourage caution, however, in equat- 
ing research findings based upon these two re- 
versal scales at least. The failure of both re- 
versed scales to correlate with the criterion 
certainly does not lend confidence to the re- 
search utility of either. 

The F scale was found to be significantly re- 
lated to scale-checking behavior on the seman- 

differential. The relatively low correlation 
between measures (rf = .22) is 
tempered somewhat when restricting char- 
acteristics of the sample are considered. The 
college Ss employed were quite homogeneous 
in performance on both instruments (F scale, 


tnese two 
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SD = .58; controversial concepts, SD = .47 
and the mean F score (3.57) was lower than 
the presumed neutral point (4.00), indicating 
that few Ss, if any, consistently agreed with 
the items. 

In spite of the sample restrictions, the rela- 
tive performances of high and low scorers on 
the F scale were in the expected direction. The 
extremity scores of Ss in the upper 20% of the 
F scale distribution were significantly higher 
than those in the lower 20% with respect to 
both controversial and noncontroversial con- 
cepts, indicating that high scorers tended to 
judge both types of concepts more extremely 
than low scorers. The extremity scores for low 
scorers did not differ for these two types of 
concepts, while both high and medium | 
scale scorers obtained significantly higher 
extremity scores on the controversial social 
concepts than on the noncontroversial con- 
cepts. Thus, it may be concluded that in com- 
parison with medium and high scoring Ss, low 
scorers have a tendency to avoid extreme judg- 
ments on both types of concepts. These find- 
ings are consistent with descriptions of the 
nonauthoritarian typically found in the litera- 
ture. For example, Adorno (Adorno et al., 
1950, p. 778) points out that the easy-going 
low F scale scorer is characterized by an ex- 
treme reluctance to make decisions or to com- 
mit himself. 


SUMMARY 

Three versions of the F scale (original F 
scale and two reversed scales) were given to 
315 college students along with a form of Os- 
good’s (Osgood, Suci, & Tannenbaum, 1957 
semantic differential. The semantic differen- 
tial consisted of six socially controversial and 
six noncontroversial concepts which wert 
judged against 20 evaluative scales. On the 
basis of postulated characteristics of the au- 
thoritarian personality, it was expected that 
Ss scoring high on an authoritarian scale would 
have a greater tendency to make extreme 
judgments than low-scoring Ss and that this 
tendency would be greater with respect to 
controversial social concepts than with respect 
to noncontroversial concepts. 

The results indicated that F 
were positively related to the tendency to 
make extreme judgments on the semantic 


scale scores 
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differential in the manner hypothesized. On the 
other hand, no relationship was found between 
either of the reversed scales and the semantic 
differential. In addition, since the correlation 
between the reversed scales was considerably 
less than their reliabilities, it was concluded 
that these scales were not equivalent forms 
even though both were worded and scored in 
the opposite direction from the original F 
scale. The utility of these scales was discussed 
in the light of these findings. 
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CASE REPORTS 


BRIEF SUCCESSFUL PSYCHOTHERAPY OF A CHRONIC PHOBIC 
REACTION! 
C. SCOTT MOSS 
State Hospital No. 1, Fulton, Missouri 


, / “is is a report of the brief successful treat- 
ment of a phobic reaction in which the pa- 
tient’s repressive defenses were penetrated, 
leading to a simple, convincing demonstration of 
the childhood learning of the phobic response and 
its subsequent ramifications. A symbolic fantasy 
procedure, tailored to the patient’s unique per- 
sonality dynamics, was then instituted in order to 
consolidate the obtained insights and further 
desensitize the patient to the fear-provoking 
stimuli. Primary reliance was placed on several 
innovations in hypnotherapeutic technique. 


TREATMENT PROCEDURE? 

The patient, aged 45, an art instructor, stated 
that she had recently seen the movie, ““The Three 
Faces of Eve,” and it finally had encouraged her 
to seek treatment for a life-long, intense fear of 
dogs. She was also motivated by the desire to 
present her daughter with a puppy on the occa- 
sion of her ninth birthday, two weeks hence. She 
requested that, as in the movie, hypnosis be used 
to uncover the source of her phobia. The patient 
made a generally favorable initial prognostic 
impression, seeming intelligent, sensitive, and 
flexible. The problem seemed relatively circum- 
scribed, the integrating capacity of the ego intact, 
and her current environmental adjustment ade- 
quate. Her response to initial induction procedures 
was favorable, and it was agreed that treatment 
would be undertaken for a maximum of seven to 
eight sessions over a three-week period. The 
remainder of the session was spent in training her 
in rapid entry into the termination of hypnosis. 
At the conclusion of the hour, she asked that the 
attempt be made first to recapture the lost memory 
of “something important” that had occurred 
during the birth of a child ten years ago. 

At the beginning of the second session, addi- 


1 Appreciation should be extended to my graduate 
class in Psychotherapeutic Theory and Methods at the 
University of Missouri for their stimulating suggestions 
in discussion of this case: Phyllis Berger and Pa- 
tricia Shallenberger, and James Hart, Fred Stern, 
Lynn Ourth, and Lyman Riegel 

? The entire therapy sequence was recorded and is 
available in transcript form for teaching and training 
purposes through arrangement with the author 


tional training was instituted to insure her coopera- 
tion with hypnotic procedures. Following her 
wish, she was regressed to the delivery scene in 
1948. It was revealed that her baby had died 
during delivery (as had two earlier children), and 
she believed that she had forgotten some vital 
fact concerning its death. However, while she 
recalled many long-forgotten details, she was 
unable to recapture the specific memory, though 
she narrowed it down to something the physician 
told the nurse immediately following delivery. 
The following posthypnotic suggestion was given: 
that the sought-after memory would gradually 
enter awareness and that if she sincerely wanted 
to remember it, she could probably do so at the 
next session. 

The patient reported at the third session that 
the intervening period had been fraught with 
anxiety because she had found herself preoccupied 
with memories from the era in question. She also 
believed that she had recaptured the memory for 
which she searched. It was the physician’s remark, 
“The baby is dead; if we wait, she may go, too.” 
Hypnosis was induced and the delivery scene 
revivified. She immediately “heard”’ the physician 
remark, “Its neck is broken.’”’ The patient identi 
fied this statement as the elusive verbalization 
She remarked upon awakening that one immediate 
benefit of these sessions was that for the first time 
she and her husband had been able to talk freely 
about this painful period in their lives. At the con 
clusion of the session, she was given the post 
hypnotic instruction that the meaning of her life- 
long fear of dogs would begin to manifest itself 
between sessions It was further suggested that 
if these memories, thoughts, or feelings were too 
disturbing, they would be communicated to her 
in night dreams. 

In the fourth session the patient reported the 
following dream: 

I kept my dreams. I did not dream Friday 
night, and I did not sleep much Saturday night 
because Jane [her daughter] was ill, but I did 
have a dream. Do you want me to read it t 
you? This was Saturday night. It was about two 
little girls, toddlers. It was a very disturbing 
dream, incidentally. One child was playing 
with the other as if she were a doll. She was 
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carrying her, putting her face next to bers, but 
the one that was being carried around was dead. 
I went to sleep worrying about Jane, so that 
might account for that. It seems as if the live 
one was my little girl, and I was distressed when 
she picked up the dead one because she did not 
realize she was dead. I asked the undertaker 
what sort of embalming fluid he used and 
whether it would hurt my little girl. My concern 
seemed to be for her. Then there was something 
about redressing the dead baby, and I was 
concerned about this. The undertaker or some- 
body was dressing it on a bed and I was very 
sad because it was so still, and then it seemed 
as if I was trying to get a box of a specific size, 
but I could not remember what the size was. I 
guess I needed the big box to put the little ones 
in. It seemed like I needed them for clothes or 
toys or something. And then it seemed as if I 
were at my sewing machine mending a badly 
torn dress. Suddenly I realized my child was 
dead and I cried a lot in my dream. Then it 
seemed there was only one child, the dead one, 
and it was mine. It was a very sad and de 
pressing dream. I never had a dream like that 
before. It was horrible even when I thought of 
it in the daytime. And I didn’t have anything 
to drink before I went to bed 


She also volunteered that, “Oddly enough when 
I woke from that horrible dream, I recalled my 
little sister’s death, and I haven’t thought of that 
for many, many years. I recall practically nothing 
about it. It is so long ago I just never think about 
that....I1t must have been about 41 years ago, 
I was about four.” Other associations were not 
immediately forthcoming, so hypnosis was in- 
luced, and it was suggested that she allow the 
full meaning of the dream to emerge. 

For a ten-minute period, the patient sat in silent 
contemplation and then, at the prompting of the 
therapist, began to share a flood of memories which 
she apparently had repressed since childhood. The 
essence of the recovered content was that at age 
four, she had been playing in the backyard of 


r home with her younger sister when tl 


the e 
family dog, Rover, knocked the little girl down, 
and she suffered a splinter wound in her cheek. 
Several days later, the little girl died, apparently 
of an infection, and the mother openly blamed the 
patient, accusing her of knocking her sister down. 

[he patient recalled the day of the funeral in 

ne detail, particularly an incident which added 


th 


he traumatic impact of the whole situation. 


Pt . Then I go back in [the parlor] by myself. 
She looks so pretty lying there, so I think I will pick 
her up. She does not hug me like she usually does, 


though. I am holding her, and the dog is with me. He 
wants to see her, too. Oh, he tore her dress! 

T: Pardon? 

Pt: The dog, he tore her dress! 

T: How did he do that? 

Pt: He was jumping up on her and tore her dress, 
and Mother will not like that. The dog should not be 
in the house, but I do not think I let him in. I know I 
should put her down but I want to hold her. 

T: What do you think and feel about her? 

Pt: She is so pretty and so sweet. I just want to 

+ hold her 

T: What happens next? 

Pt: My mother comes into the room. She is mad at 
me. She told me I should know better. Somebody took 
the baby away from me 

T: Did you want to give her up? 

Pt: No, I wanted to hold her. It must be my Aunt 
Anna. She said I should not have torn her dress, but 
I didn’t tear her dress. 

T: [Simulating the Mother] Somebody did! Who 
did it? 

Pt: I didn’t do it. Rover did it; I didn’t. He didn’t 
mean to, he just wanted to see her. Why doesn’t that 
place on her face go away? 

Within a few days after the sister was buried, the 
patient’s attitude toward her little dog changed 
markedly. 

Pt: Now it seems like it is some days later. I am 
on the back porch by myself. I wish she would come 
play with me. Mother comes out the back door, and 
she finds me crying. I don’t want her to see me crying. 

T: You don’t want her to know how you feel? 

Pt: No. She thinks I hurt her, but I didn’t. 

T: Your mother blames you for what happened to 
your sister? 

Pt: She thinks I pushed her over, but I didn’t. I 
don’t seem to like my dog anymore. 

T: He still wants to be friendly, doesn’t he? 

Pt: I don’t want to be friends. 

T: You don’t like your little dog? 

Pt: No, I don’t like my little dog anymore. I want 
my sister. 

[: Your little dog took your sister away from you? 

Pt: I don’t like him. I wish that they would take 
nim away. 

T: You want him to go away and your little sister 
to come back? 


Pt: Yes 


Prior to awakening the patient was given in- 
structions to the effect that she was free to re- 
member as much of this experience as she was 
prepared to accept. Upon awakening, she obviously 
recalled the entire episode. 


T: Did you consciously recall your little sister’s 
death? 

Pt: I remembered very little about it. I remember 
she died. I heard them talk about it. The doctor in this 
little town didn’t know what was the matter. They 
removed a splinter from her face. He thought it was 
an infection perhaps, but I didn’t know what had 
happened. 








268 CASE REPORTS 


I: Did you ever suspect that your mother had 
blamed you? 

Pt: Oh, no, I didn’t know Mother had blamed me 
I can understand, though, some of my actions as a 
child. I always wanted to please everybody. I can 
remember that at school I would take blame for things 
that other kids did, and my mother would get real 
provoked with me because every week she had to go 
to the principal. If the teacher wanted to blame some 
body, I was always glad to take the blame. As I grew 
older, I realized it was real stupid, but I guess I didn’t 
want to be blamed for anything. I mean I didn’t want 
anybody to be blamed for anything, so I took the 
blame. I was mixed up, I know that. I wanted people 
to like me as a child. I expect that Mother must have 
been terribly upset about me picking up the baby. 
Of course, if she believed I had caused her to fall, she 
would have blamed me 


At the conclusion of the session, a posthypnotic 
suggestion was given that in the interim, the mean- 
ing of this experience and its total effect upon her 
life, past and present, would begin to make itself 
evident to her. The patient also expressed an inten- 
tion to write her mother and discuss the event that 
therapy had uncovered. 

At the beginning of the fifth session, the patient 
volunteered, “If there is any question about our 
making progress, we are.”’ She stated that for the 
first time in her (adult) life, she had petted a dog 
She also reported a reassuring dream in which she 
was attacked by a large, vicious dog but managed 
to calm him 

Jn the three days between sessions, the patient 
had experienced the compuisive recall of a series 
of memories beginning in the recent past and ex- 
tending back to early childhood. In the waking 
state and under hypnosis, these experiences were 
explored in detail. Through them ran a theme of 
feeling misunderstood and mistreated by her 
family, strong resentment and associated guilt, 
and a continuous struggle for independence and 
self-realization. 

One memory concerned an event at age eight 
involving a puppy her parents gave her in hopes 
she would overcome her fear. She reported in 
detail an episode in which she feared the dog might 
attack another younger sister, and how imme- 
diately afterward, it had a fit and died. This 
experience apparently reinforced the potency of 
the origial trauma. Careful exploration revealed 
that the patient seemed to have no conscious 
knowledge of the reason underlying her intense 
fear of dogs, even at this age. The patient subse- 
quently was regressed to successive age levels, but 
no evidence whatsoever could be found to indicate 
that she had ever experienced a physical attack 
by a dog which would account for her fear. This 
indirectly testified to the validity of the early 
experience recalled in the previous session in 


establishing her phobia. The patient, incidentally, 
had been able to verify many of her recently 
recovered memories pertaining to the past twenty 
years by reference to a meticulous, daily family 
diary and file of all letters they had ever exchanged, 
kept by her husband. A posthypnotic suggestion 
again was given that the full meaning and effect 
of this early experience would continue to manifest 
itself. 

In the sixth hour, the patient appeared more 
relaxed than at any time since the first session. 
She verbalized a number of new found insights; 
for example, she now understood why she had 
always had the need to be liked and the fear of 
offending others, the basis for her deep seated 
ambivalence toward her mother, and the reason 
underlying her discomfort in disciplining her 
daughter. Another insight pertained to her revul 
sion at touching dogs: the lively or frisky feeling 
of the dog brought back, through contrast, the 
iimp feeling of her little sister experienced at the 
time of her funeral. 

During the session the technique of age regres- 
sion was again employed, and the period between 
her early traumatic experience and the one at age 
eight, uncovered in the previous session, was 
explored. So far as could be determined, conscious 
memory of the earlier event ceased to exist even 
prior to age five. To the end of determining what 
there had been in the movie, “Three Faces of 
Eve,” that finally had prompted her to seek pro- 
fessional help, the patient was instructed to project 
on an imagined movie screen the scenes from the 
movie which had, consciously and unconsciously, 
the greatest personal meaning for her (Moss, 
1957; 1958). A fairly strong identification with 
the movie heroine was revealed, based on the 
similarity of childhood experiences relative to 
death and the resulting feeling that no one, parents 
or spouse, seemed to understand the consequent 
emotional disturbance. The movie also increased 
her optimism that she might be helped, especially 
through hypnosis. 

A major therapeutic tactic designed to facilitate 
the relearning process was introduced near the 
conclusion of the session. The patient was given 
the posthypnotic suggestion that between sessions, 
whenever she was alone and desired to re-estab- 
lish her acquaintance with Rover, the dog of her 
early childhood, she could experience a positive 
visual hallucination of him. It was emphasized 
that the dog symbolized her disturbing history, 
and this fantasy relationship provided her with 
the opportunity to make peace with her past. It 
was further emphasized that she had complete 
control of the relationship and could “allow the 
dog as close as [she] desired or keep him as distant 
as [her] fear dictated.” 
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In the seventh session the patient reported that 
she had written to her mother concerning the 
recovered disturbing memories and that although 
the mother had confirmed the existence of a small, 
black dog, she insisted that the patient’s fear was 
the result of prenatal conditioning. This seemed 
quite inconsistent to the patient since, from other 
sources, she knew that they had had several family 
dogs prior to the time when she was four and had 
never had another except for the puppy when 
she was eight. 

The patient reported increased feelings of com- 
fort in her general relationships with dogs. She had 
been able to experience Rover’s hallucinated pres- 
ence quite vividly and also had succeeded to some 
extent in making friends with him. She laughingly 
referred to him as “my little black Harvey.” At 
this juncture, the patient agreed that sufficient 
progress had occurred to warrant a month’s vaca- 
tion from therapy; however, she requested that 
she be allowed to keep Rover in order to renew 
further their acquaintance. She reported that her 
feelings toward him were no longer those of fear or 
even sadness, but instead “like two old friends who 
have quarreled and don’t quite know how to make 
up.” 

She was given the posthypnotic suggestion that 
she continue to work through her problem in the 
following form 

Of course, it will not be possible for you to 
keep Rover indefinitely. As you pointed out 
today, it would be so nice to have Rover really 
there. I think you can have Rover really there 
in a sense that down through the years Rover 
has always been there in every dog that you 
met, particularly in little, frisky dogs. Each 
dog has been Rover as you looked at him with 
your little-girl eyes. If you work hard at becom- 
ing friends with Rover, your fear will leave you 
and you will quite naturally develop a greater 
acceptance of all dogs As time goes on you will 
develop the idea of having a real Rover for you 
to enjoy and especially for Jane, who needs and 
wants a Rover of her own. 

A relatively brief session was held one month 
later. The patient reported that five days after 
the previous session she had gone skating with her 
daughter and met several dogs. “It seemed very 
natural to reach down and pet them. There was no 
fear of doing it. They did not feel like dogs used to 
feel. I heard Jane yell, ‘Oh, look, Mommy used 
to be afraid of dogs and isn’t anymore,’ and then 
it dawned on me that I really wasn’t. It seemed 
that right after that I lost Rover.” Laughingly 
she inquired, “You don’t have him around here 
anymore, do you?” She stated that the family 
was now actively looking for a dog. She also pre- 
sented the therapist with an unsolicited fifty-nine 


page narration concerning her daily feelings and 
reactions during treatment. Three weeks later, the 
patient phoned to report that she had purchased a 
black cocker spaniel puppy and had named it 
Happy Siesta. 

Almost one year to the day later, a follow-up 
interview was held with the patient. She began 
the session with the question, “Suppose the event 
I recalled wasn’t true: could the mind make up 
something like that?” Discussion revealed other 
evidence of some residual resistance against a com- 
plete emotional acceptance of her childhood 
trauma; for instance, some elements of the experi- 
ence wére now incompletely remembered, and she 
described the memory as “not quite real.” She 
also reported that while her mother continued to 
deny that the event had ever happened, a brother 
had confirmed the existence of a family dog named 
Rover and also the fact that she and her sister 
had been left momentarily alone at the time of the 
fatal accident because the two brothers, who had 
been carrying in wood, had gone to witness a fire 
at a neighbor’s. Her younger sister also recalled 
that the patient had anxiously protected her in 
the presence of any and all dogs.* 

It was explained to the patient that it was rela- 
tively unimportant whether the memory of her 
childhood experience was correct in every detail, 
that the important factor was the effect of the 
therapeutic experience on her present adjustment. 
She reaffirmed that therapy had somewhat altered 
her relationship with her mother, daughter, and 
husband. The most evident effect of psychother- 
apy, however, continued to be a marked change in 
the patient’s attitude toward dogs. “All my life 
the touch of one gave me a feeling of something 
horrible. I’m still surprised that I feel so differ- 
ently now.” 


DISCUSSION 


This case provides a clear demonstration of the 
use of a psychotherapeutic procedure in modifica- 
tion of a long standing learning process. Adjunc- 
tive hypnotic techniques were of especial value in 
rapidly uncovering and revivifying the childhood 
pathogenic experience, allowing the adult ego to 
grapple with it in a new attempt at mastery. 


3 There was considerable circumstantial evidence to 
support the occurrence of a traumatic experience of the 
nature reported by the patient. Most convincing, 
possibly, is a congruence to the known dynamic struc- 
ture basic to many phobic reactions; that is, the phobia 
dates from a fear-provoking episode in childhood, 
usually involving a severe disturbance in the parent- 
child relationship. Associated anxiety, shame, and 
guilt result in suppression and eventual repression of 
the frightening experience, leaving only the residual 
phobic symptom 
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Relearning was further facilitated by reinstate- 
ment on a symbolic fantasy level of the patient’s 
relationship with the original object of her phobia 
and with sufficient vividness to make it a realistic 
and immediate emotional experience. Hypnosis 
was also of value in reinforcing the patient’s moti- 
vational level and at times in providing an almost 
compulsive re-experiencing of long-forgotten 
memories. In this manner, substantial progress 
was facilitated between sessions and following the 
termination of formal sessions, apparently sig- 
nificantly shortening the period of actual treat- 
ment. Success in this direct relearning procedure 
was naturally dependent on the assessed capacity 
of the patient’s ego to assimilate repressed infan- 
tile experiences into conscious awareness. 

The foundation for the patient’s neurosis appears 
to have been a lack of understanding, acceptance, 
and emotional warmth in relation to her parents, 
particularly the mother. The traumatic incident in 
which she was unjustly accused of her sister’s death 
focused her feelings of being mistreated, misunder- 
stood, and rejected onto dogs. Specifically, the 
physical sensation of her sister’s cold, still body 
became associated with its opposite, the warmth 
and liveliness of the real culprit, her dog. In a sense, 
dogs thus became an idiosyncratic symbol of the 
disturbed relationship with her mother. 

In spite of her use of the dog as an object for 
projection and her protestations of innocence, 
continued accusation and rejection by her mother 
seem to have generated some guilt in the patient. 
This guilt appeared in her eagerness for punish- 
ment as a child, which may be variously inter- 
preted as direct atonement, an assertion of inno 
cence by being unjustly punished (a repetition of 
the circumstances surrounding her sister’s death), 
a technique for forcing her mother to support her 
innocence by having continuously to get her out 
of trouble, or all of these. Some feelings of guilt 
were also apparent in the deaths of her children, 
and it appeared very important for her to prove to 
her husband that she was not to blame 


One important consequence seems to have been 
the patient’s inability to express directly her feel- 
ings of hostility and resentment. A hypothesis 
that had to be considered during treatment was 
that she originally had projected these feelings 
onto her dog and that her phobia protected against 
their expression. Her desire to present her daughter 
with a puppy (dependent on eradication of the 
“protective” symptom) might then have negative 
implications. Fortunately, the year witnessed 
some improvement in the patient’s relationship 
with her daughter with no evidence of an increased 
display of hostility. 

The approach employed encouraged the patient 
to re-evaluate the influence of her childhood 
experiences in terms of present day understanding 
and to effect a fuller readjustment, not only to 
the original conflict, but to its ramifications in her 
present day life. No claim is made for achievement 
of a complete cure in the sense that the patient 
achieved full readjustment to all aspects of the 
widely generalized effect of the original conflict. 
Neither should the impression be conveyed that 
hypnotherapeutic techniques afforded a panacea 
per se. The patient described her youth as char- 
acterized by “a fear of people, new situations, 
many things,” and her marriage as having had a 
highly therapeutic effect. The death of a beloved 
older sister when the patient was aged 27 helped 
the patient partially to overcome her fear of death. 
Factors such as these contributed to a substantial 
degree of “spontaneous recovery”’ and established a 
“readiness” to circumvent the repressive defense 
upon which the hypnotherapeutic procedure then 
capitalized. 
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AN INTERVIEW WITH A MUTE CATATONIC 


JOSEPH LYONS 


Veterans Administration Hospital, Lexington, Kentucky 


F THE classic symptoms of catatonia, mute- 
O ness captures the interest of the clinician to 
a greater extent than does immobility or even 
flexibilitas cerea. Whether this is due to a recogni- 
tion of the importance of language as a medium 
for social relations, or simply because of the high 
that members of a “talking” profession 
place on speech, is not easy to say. In any case, a 
situation in which a mute patient is persuaded to 
talk promises to be both dramatic and revealing 
In addition, as the following interview shows, it 
has much to teach us about the world of the schizo- 
phrenic, even when the patient offers little that 
seems to be explicitly informative or introspective. 

rhe patient, Evan Lear, aged 29, was admitted 
to the hospital in a catatonic episode of acute, 
sudden onset. Seen in Admission Staff conference 
m the same day, he was mute and somewhat di- 
sheveled, responding passively when led into the 
room but resuming his tense, rigid attitude when 
seated. During the interview his facial expression 
varied from a flat, silly smile to obvious fear 
Examined a few hours later under thiopental 
sodium (pentothal, Abbott), he said that he did 
not talk because “one person should not do all the 
talking’”’ and because he did not want to “tell 
tales out of school.’’ Although he was evasive and 
noncommittal, there was some evidence of para 

id thinking. 

The next day he was given amobarbital (amytal 
and while under the influence of 


lie 
Vaiut 


sodium, Lilly), 
the drug he ate voraciously and commented spon- 
taneously about how hungry he was. During the 

lowing week, there was an unsuccessful attempt 
to administer psychological tests. In a second 
attempt, using amobarbital, he produced a Ror 
schach protocol and some drawings. These were 
interpreted as showing a long-standing schizo- 
phrenic condition with paranoid features in a man 

average intelligence. The Rorschach responses 
were almost all Wholes, with good form, no color, 
nd only one Human percept. On Card VII he 
saw “a woman broken in two,” and to this he 
“Looks like they’ve 


commented in the inquiry, 
lrawed a little too close together.’ 

At the Medical Staff two weeks after his admis 
ion, it was reported that although he had been 
feeding himself with a spoon, he remained mute 
on the ward. He usually displayed a silly smile but 
rarely looked around him and had no interpersonal 
Occasionally he other symp- 
waxy flexibility to a mild degree, manner- 


contacts showed 


toms 


istic moving of his hands, and rapid blinking of his 
eyes. Except for a few impulsive outbursts of 
aggression, he remained essentially unchanged for 
the next month, while permission to begin electric 
shock treatment was sought from his father. 
Meanwhile he had been studied intensively, by 
tape recording and motion picture 
photography, in the Research Laboratory of the 
hospital. Approximately nine weeks after his ad- 
mission, the interview which is reproduced here 
was obtained as part of a continuing study of cata- 
tonic expression. Two psychiatrists and a research 
psychologist (identified in the text, respectively, 
as Drs. A, B, and C) collaborated in an attempt to 
have the patient reveal his experiences in the 


means of 


stuporous condition.! 


INTERVIEW 


Dr. A.: We hope we can help you. —To understand 
you better, we have given you some drugs which will 
make it easier for you to talk. Perhaps then you can 
tell us something about yourself, and about the way 
you look at the world. —That isn’t painful, huh? 

Do you have a little bit of a strange taste? —You 
have been with us now for quite a long time, and in 
all these days you haven’t told us much. Do you think 
you could do it today? —If you don’t want to talk, 
you could nod. Could you do that much? —You may 
get tired. Do you feel tired? Huh? Do you feel tired? 
Couldn’t you give me a sign, just a yes or no? —Feeling 
tired? Huh? How is it? Feeling sleepy? —Say, do you 
know that this is a hospital here where we are? Do you 
know that? That we are doctors? Do you know that? 
This hospital is at Lexington, Kentucky. Do you know 
that? You haven’t given me any sign whether you 
know it or not. That is not much to say, yes or no. 
Huh? Are you drowsy? Say, do you feel drowsy, 

1 For the interview, the patient was prepared with 
an intravenous injection of 3.75 grains of amobarbital. 
The entire session, which lasted approximately 50 
min., was tape recorded. It is given here with no 
editing except for necessary punctuation; the reader 
will recognize that pseudonyms have been used fcr the 
real names of persons and places. 

It should be kept in mind that the interview actually 
proceeded quite slowly and that the impression which 
may be gained in reading, that the interviewers were 
pressing or badgering the patient, results from the 
necessary compression of the spoken material on the 
printed page. The dashes are meant to represent 
pauses; however, no effort has been made to show 
pauses between speakers. Actions by the patient are 
given in parentheses; the bracketed and indented 
portions are comments by the writer. 
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sleepy? —Would you like to lie down? Would you like 


to lie down? 

Dr. B.: We know you can talk. Suppose you tell us 
your name. —What is your name? Huh? What is your 
name? —Is your name Smith? 

Dr. A.: Who are you? Now tell us, who are you? 
—Then what are you? Okay, who are you? What are 
you? Tell us, please. Are you one of us, or are you 
somewhat different? —Getting sleepy, huh? Getting 
sleepy? Keep awake, can you? Can you keep awake, 
buh? 

Dr. B.: Supposing you tell us your name. What is 
your name, huh? What is your name? 

Dr. A.: Could you be good enough and tell us who 
you are? 

Dr. B.: What is your name? Come on, now, teil us 
Carl? John? Tell us your name—what is it? 

Dr. A.: Do you hear us? Don’t be so proud you 
cannot give a sign. Give us a sign whether you hear 
us or not, huh? Huh? Hello. —You would rather go to 
sleep than to speak, huh? Can you open your eyes? 

Dr. B.: Open your eyes. Look up here. Come on, 
look up here. Look up, now 

Dr. A.: Do you want some food, something to eat? 

Who are you? Who are you? Come, let us know. 
What are you? 

Dr. B.: Look up here. Who are you? Who are you? 
Open your eyes, now, and look up here. Come on. 
Open your eyes—wider, wider. Open them real wide. 
Now look up here. Look up. Open your eyes and look 
up here. Look up, now. Who are you? Tell us who 
you are. Who are you? 

Dr. A.: Perhaps you could write it down if you had 
@ paper and pencil 

Dr. B.: Who are you? What’s your name? What is 
your name? What’s your name? What’s your name? 
Tell us who you are. Tell us who you are. Look up, 
now, open your eyes. Open—open your eyes and look 
up. Who are you? (Patient is given paper and pencil, 
which he takes.) 

Dr. A.: Would you write it down? Would you write 
down who you are? I will hold it for you. And what 
are you? What are you? Write it down. (Patient writes 
his full name.) 

Dr. A.: That says who you are, but we would also 
like to know what you are. Is that all you can do? 

Dr. B.: What does that say? What does this say? 

Dr. A.: Could you write down how old you are. 
Patient writes some numbers.) Thirty-one years old? 

Dr. B.: What is that? Serial number. 

Dr. A.: Yes 

Dr. B.: Was that Army or Navy? Was that Army 
or Navy? —Is your name Evan? Is your name Evan? 
Is that your first name? Is it Evan? Evan? Is that 
right? (Patient nods.) It is Evan. And were you in the 
Army or Navy? 

Dr. A.: Write it down. (Patient writes “Army.’’) 
Dr. B.: Now can you tell me what that says. 
[Of interest in the interview thus far is the eagerness 
of the doctors to have the patient say something, 
no matter what it is. Now that he is about to utter 
his first words in months, one almost expects that, 
like the Sphinx, he will reveal a profound secret 
that will not be an anticlimax to his profound 
silence. But notice the interchange that follows.] 


Pt.: Ar—my. 

Dr. B.: How long were you in the Army? What’s 
that? Huh? 

Pt.: Two. 

Dr. B.: Two what? Two years? Is that right, two 
years? Is that the right time you were in the Army, 
huh? 

Pt.: Uh-huh. 

Dr. B.: And where do you live? 

Dr. A.: Come, write it down. (Patient writes an 
address. ) 

Dr. B.: I can’t read that too well. Can you tell me 
what it says? 

Pt.: Care of Edgar Lear, R.F.D. 2, Esquirol, Ohio. 

Dr. B.: Is that your home? 

Pt.: Yes sir. 

Dr. B.: And how long have you been there, Evan? 

Pt.: I have been there all of my life. 

Dr. B.: And do you know where you are now? 

Pt.: Approximately. 

Dr. B.: Approximately, where? What kind of a 
place is this? 

Pt.: A V.A. hospital. 

Dr. B.: That is right. Now, do you know what state 
it is in? 

Pt.: Kentucky. 

Dr. B.: Do you know what town this is? 

Pt.: This is Lexington. 

Dr. B.: And how long have you been here now? 

Pt.: How long? 

Dr. B.: Uh-huh. 

Pt.: Two months and ten days. 

Dr. B.: Two months and ten days. Do you have 
any idea why you are in the hospital? 

[At this point the patient becomes silent, as can be 
seen from the passage that follows. In order to 
understand this, we have to consider the kind of 
question that produced the silence. Note that it 
differs from the preceding ones in that it is not a 
question of fact. The answer to it requires that the 
patient consider his present situation with some degree 
of reflective awareness, that he adopt toward himself 
an attitude different from the attitude one has toward 
the objective, public data of one’s life. This he appar- 
ently cannot or will not do. The first hint of his 
failing in this regard, in fact, was given when, to the 
first question about where he was, he changed his 
previously precise answers to the slightly evasive and 
imprecise word, ““Approximately.”’] 

Dr. A.: You have been talking so littlke—why? Is 
some one forbidding you to talk? 

Dr. B.: Do you have any idea, Evan, why you are 
here? Is there something the matter with you? Huh? 

Are you sick? Evan, are you sick? —You haven’t 
answered me. Are you sick, Evan? —We think you are 
sick. Are we wrong? Or are we right? Huh? What 
would you say? —Evan, what would you say about 
that? —How do you feel? Huh? Evan, do you feel all 
right? Come on, now, you are not that sleepy. How do 
you feel? —Huh? Do you feel all right? Evan. 

Dr. A.: Do you have any orders from God to keep 
silent? Does God order you not to talk? 

Dr. B.: Evan, look up here a minute. How do you 
feel? —How do you feel? Talk to me, how do you feel? 
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Do you feel well? Huh? Do you feel well? Does any- 
thing hurt you? Does it? 

Dr. A.: Would you like to write it down—how do 
you feel? 

Dr. B.: Evan, look up here, now. Tell us how you 
feel. Does anything hurt you? Huh? —Nothing hurts 
you. Do you feel good? Do you feel good or bad? Huh? 
—Come on, now, you know whether you feel good or 
bad. Tell us. Which is it? Evan, which is it? Huh? 
—Come on, spit the word out—which is it, good or 
bad? —Aren’t you going to tell us? Or don’t you 
know, huh? Evan— 

Dr. A.: Do you want to write it down? You could 
write “good” or “bad.” Could you? 

Dr. B.: Why don’t you look at (Dr. A.) when he 
talks to you? Come on, now, and look at him. He is 
not such a bad looking guy. Turn your head around 
here and take a look at him. Go on. Look up at him. 
Evan. 

Dr. A.: Do you have a secret which you don’t want 
to reveal? That you don’t want to reveal? Evan, do 
you have a secret? No secret. (Patient is given a drink 
of water, which he accepts.) 

Dr. B.: How are you feeling now, better? A little 
drunk? Huh? 

Dr. A.: Could you talk a little bit more? 

Dr. B.: Would you like to have another drink? 
Huh? What do you say? —You look like you are feeling 
pretty good now. Do you? Do you? Huh? —You look 
like you are enjoying this—are you? Does this make 
you feel pretty good, Evan? Evan, what do you say? 
Are you enjoying this? —I have some more where 
that came from. Would you like to have some more? 
Huh? How about it? Do you want some more? —What 
do you say, Evan? Do you want to quit it or do you 
want some more? Would you like to continue? 

Dr. A.: Do you have a brother or a sister? Have a 
brother or sister? 

Dr. B.: Evan, Dr. A. asked you a question 

Pt.: Three 

[The question that brings the patient back into the 

conversation is a straightforward question of fact.] 

Dr. B.: What? 

Dr. A.: Three sisters? 

Dr. B.: I can’t make much out of that. You will 
have to tell us. Brothers, sisters, both? 

Pt.: Two. 

Dr. B.: Two what? Two what, Evan? 

Pt.: Brothers. 

Dr. B.: Do you have any sisters? 

Pt.: One. 

Dr. A.: I think he said four 

Dr. B.: How many, Evan? 

Pt.: One sister. 

Dr. B.: Is your sister older than you are, or younger? 

Pt.: Younger. 

Dr. B.: Is she married? 

Pt.: Yes. 

Dr. B.: How about you, are you married? —-Have 
you ever been married? 

[The patient does not answer this pair of questions 

He is selective, even in regard to questions of fact.] 

Why not? Don’t you like the girls? Huh? Or haven’t 


you found one you would like to know? —Does your 
married sister have any children? 

Pt.: One. 

Dr. B.: Then that makes you an uncle, doesn’t it? 
—And how about your brothers—are they married? 
Are they younger than you or older? 

Pt.: They are younger. 

Dr. B.: Then you are the oldest in the family—is 
that right? You are the big brother? 

Pt.: Uh-huh. 

[Again, as the questioner touches on an area about 
which the patient might have some regard for its 
significance in his life history, Evan shies away. It 
should be noted, however, that it is not simply the 
biographical or “personal” questions at which he 
balks. He will answer questions about himself. The 
distinction is that he will respond only to questions 
which do not call upon him to express some measure 
of reflective understanding of his position as a 
meaningful point in his personal history. Thus, the 
result of the interview so far is that some information 
has been obtained which one might get more easily, 
and with greater confidence in its accuracy, from the 
patient’s case folder.] 

Dr. B.: And do they like their big brother? Huh? 

Pt.: I couldn’t tell you. 

[At this point, for the first time, the patient does not 
fall silent at a question of personal significance but 
simply pushes it away. We will see below that he 
has other methods at his disposal] for dealing with 
such pertinent questions. | 

Dr. B.: Oh, I bet you have some ideas about that. 
‘ou really couldn’t say? —Are your folks living, Evan? 

Pt.: Yep. 

Dr. B.: How old is your dad? 

Pt.: I figure around fifty-four. 

[This is an excellent, well-phrased answer, only one 
of many well-spoken sentences which occur through- 
out the interview. Could one guess, on the basis of 
this answer, that the patient is only a few moments 
or a few words away from one of the most striking 
of schizophrenic symptoms? Evidence of this kind 
makes it difficult to maintain that the patient is the 
helpless victim of an overwhelming emotional state or 
of a pervasive disorder of thinking. The interchange 
that follows is particularly illuminating in this 
respect.] 

Dr. B.: What is his name? 

Pt.: Edgar P. Lear. 

Dr. B.: And your mother, how old is she? 

Pt.: How old she is? 

Dr. B.: Probably about the same age your father is? 
Pt.: Close. 

Dr. B.: And what is her name? 

Pt.: Cordelia Regan. 

Dr. B.: And they live in Esquiroi, Ohio? 

Pt.: No, they have been separated. 

Dr. B.: And who have you been living with? 

Pt.: My dad. 

Dr. B.: And where does your mother live? 

Pt.: Gone to Goneril, Ohio. 

Dr. B.: Ohio? 

Pt.: Yes. 

Dr. B.: Is that very far from Esquirol? 
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Pt.: About twenty miles—twenty-five 

Dr. B.: That is not so far. Do you ever visit your 
mother, see her? Does she ever come to see you? 

Pt.: I see her every once in a while 

Dr. B.: Is she re-married again? 

Pt.: Yep, a Regan 

Dr. B.: She is married to somebody by the name of 
Regan. And who is there at home with your dad? 

Pt.: There wasn’t nobody when I left. 

Dr. B.: The others were all away? 


Pt.: Uh-huh 
Dr. B.: Just you and your dad live there together? 
Pt.: Yep 


Dr. B.: What sort of work did you do? 
Pt.: Made poultry coops 
Dr. B.: Poultry coops. Are you a carpenter—that 
sort of thing? 
Pt.: No, I’m not a good carpenter 
Dr. B.: Well, how do you make poultry coops—what 
do you use? Lumber and chicken wire, is that what 
you mean? 
Pt.: I used any kind of wood you can get hold of 
Dr. B.: Uh-huh. And do they raise a lot of chickens 
around that part of the country? 
Pt.: Quite a few 
Dr. B.: And these coops you made, you sold—they 
weren’t just for your own use? You made coops for 
others too? 
Pt.: They were for use to put poultry in 
[Clearly, this patient is capable of a kind of tart 
dignity which sometimes even makes the questioner 
seem a little foolish.] 
Dr. B.: And how was business—was it pretty good? 
Pt.: Yes, it was pretty good. 
Dr. B.: Were you working before you came to the 
hospital? 
Pt.: Yep 
Dr. B.: How long hadn’t you been working before 
you came to the hospital? 
Pt.: What was that question again? 
|Here, if anywhere in the interview, is evidence that 
the patient is in contact, even to the extent of asking 
for clarification of a question.] 
Weren't working? 
Dr. B.: Yes. Did you work before you came in, that 
is what I mean. Or had you not been working for a 
couple of weeks? Do you remember now? Huh? —When 
you came in, Evan, you weren’t talking to us. Did you 
stop talking a long time before you came to the hos 
pital? 
[At long last we come to the topic which is the real 
purpose of the interview. What will the patient say 
when he is finally confronted with the issue of his 
muteness?] 
Pt.: I never did talk too much 
Dr. B.: No, but you talked some 
Pt.: Yep, a little 
Dr. B.: And did you stop talking after you got here 
or before you came in? 
Pt.: Before I came here 
Dr. B.: Was it a long time before or just a couple of 
days? Huh? Do you remember now how long it was 
that you had stopped talking before you came here? 
Pt.: I don’t know just how long 


Dr. B.: Do you have any idea why you stopped 
talking? Huh? —What made you stop talking? Huh? 
—Did anything make you stop talking, or did you 
just decide to quit? Huh? Which was it? Evan, huh? 
—Did you just decide to quit talking or did something 
make you stop? Huh? Evan—why did you stop talking? 
Huh? —Evan, you haven’t answered my question. 
Why did you stop talking? Huh? —Aren’t you going 
to answer that question? Huh? What do you say? 

Pt.: I couldn’t say yes and I couldn’t say unh-uh 

Dr. B.: You mean you don’t know why you stopped 
talking? 

Pt.: No 

Dr. B.: Did you get some message from some place 
that wanted you to stop talking? 

Pt.: No. 

Dr. B.: Isn’t it kind of hard to get along with people 
and not talk to them? 

Pt.: Not always. 

Dr. B.: Do you think it is easier by not talking? 

Pt.: I wouldn’t say that. 

Dr. B.: Well, you have been here, you have told me, 
three months and ten days? 

Pt.: Approximately. 

Dr. B.: And you have hardly said a word. You know, 
we would like to know how you feel and what is bother- 
ing you—and maybe we can help you. Would you like 
to get well and go home? Huh? —Would you like to 
get out of the hospital, Evan? Huh? Or haven’t you 
given it any thought? Huh? 

{At this point the patient has become silent again. 
What have the interviewers achieved thus far? The 
patient has given some information by answering 
questions of fact; he has told us that one of the 
common explanations for his silence, that he re- 
ceived a “message” to this effect, is not true for him; 
on matters of opinion he remains noncommittal; and 
on questions concerning cause or significance, he 
simply does not talk at all. Thus, he leaves us in 
doubt at the very point where we most desire an 
answer—although this finding may itself be some 
thing from which we can learn.] 

Dr. A.: Do you like it here? Huh? Do you like it 
here? 

Dr. B.: You can say yes or no to that one, can’t you? 
Huh? Do you or don’t you? 

Dr. A.: Just say no if you feel that way. 

Dr. B.: Do you like it here in the hospital, Evan? 
Huh? How have they been treating you here? Have 
they been treating you all right? Huh? —What do you 
say? Has everybody been treating you all right? Huh? 

Dr. A.: Would you like to talk if you could? Huh? 

Dr. B.: Evan, why don’t you answer (Dr. A’s) ques- 
tions? Would you like to talk? Huh? —Is it hard for 
you to talk, Evan? Does it hurt you to talk, or does 
something happen to you when you talk—that you 
would rather not, huh? Is that what’s wrong? Evan, 
you know you have never told us why you don’t want 
to talk. Why is it? Huh? Are you afraid talking will 
get you into some sort of trouble? Or that you might 
tell somebody a secret you don’t want to? Huh? Evan, 
You have never told us why you 
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[It is ironic that the interviewers are still making 

these requests after the patient has indeed “talked.”’] 

Dr. A.: Do you think we are not good enough to 
talk with you—that we are not good enough? Huh? 

Dr. B.: Evan, what do you say? Would you like us 
to stop bothering you? Huh? —Are we bothering you, 
Evan? (Patient shakes his head.) 

[He may not be talking, but he is not out of contact.] 

Not bothering you. —Then why don’t you talk? 
Huh? Tell us that? Why don’t you talk? Evan. Why 
don’t you talk to us? 

Pt.: There ain’t much to say, is there? 

[This is an astounding statement. What are we to 
make of it? Is it possible that the patient has heard 
and understood the coaxing and cajoling to have 
him talk, that he is aware of how important his 
talking is to his doctors, but that he simply dis- 
agrees with them that he ought to talk? This is one 
possible explanation of his remark, but it seems 
highly improbable. On the other hand, if we accept 
his statement as describing how things appear to 
him—as I think we must—then we may have to 
make some adjustment in our views concerning his 
“conflicts.”” Rather than saying that his silence is a 
determinate result of the psychic forces produced by 
his conflicts, we might propose that it is the other 
way around; that he maintains his silence in order to 
avoid a conflict to which talking might lead; that 
the decision to keep silent precedes, rather than 
follows, the presence or absence of conflict. This 
would be consistent, too, with the evidence that he 
is unerring in his selection of the kinds of questions 
that he answers.] 

Dr. B.: Well, sometimes there is. —We could get 
along much better if you would talk to us. —Are you 
afraid of saying something that you shouldn’t? Huh? 
Evan, are you afraid that you might say something 
which you shouldn’t? —Doesn’t it seem rather strange 
to you that a person just refuses to talk for three 
months? Now, you can say good morning. —Isn’t that 
kind of peculiar, Evan? Huh? —Isn’t it? —Tell me— 
doesn’t that seem strange to you? Huh? —Doesn’t 
that seem peculiar, Evan? —What do you think about 
that? —What would you think about someone who 
didn’t talk for three months? Huh? 

Dr. A.: Well, he feels he won’t do it today, either. 

Dr. B.: Oh, sure he will. —Evan, what do you think 
about somebody who won’t even talk to anybody? 
Huh? Who wouldn’t talk for three months? Wouldn’t 
you think there was something wrong with them? Huh? 

Did you have an argument with your father before 
you came into the hospital? Huh? 

Pt.: No. 

Again the patient is perfectly “willing’”’ to re-enter 
the conversation, but only on very specific terms. 
The questions must be of the type that usually appear 
on a face sheet or an application for employment— 
questions of fact, the content and accuracy of whose 
answers could be a matter of public record. The 
result of the patient’s “decision” for muteness would 
seem to be this, that as far as others are able to know 
or discover, he is now completely a “public” creature; 
no longer a unique individual, with a private life 
which he hides or reveals, shares or keeps to himself, 


expresses or conceals, but something which only looks 

like a person but has no more life than a case file or a 

set of vital statistics.] 

Dr. 8.: You didn’t get into any words with him? 
(Patient shakes his head.) Then why are you here? 
—Do you think you should be in the hospital? Huh? 
—Evan? What do you think about it? Should you be 
in the hospital? Huh? —What do you think about 
that, Evan? —Have you ever seen this gentleman 
before? Do you know his name? —It is (Dr. C.) 

Dr. C.: We are old friends, aren’t we, Evan? Huh? 
—Do you remember playing ball over there? —You 
drew me a picture once. What did you draw, Evan? 
Huh? —Do you remember you drew a nice picture, a 
nice big picture? What was it? 

Pt.: A little pyramid. 

Dr. C.: You very carefully drew a pyramid. Then 
you drew something on top of it—what was that? 

Pt.: A little star on top of them. 

Dr. C.: You were afraid the stars were going to fall 
off, weren’t you? —Do you know what else we talked 
about over there? —You remember the bullet wound, 
don’t you? —Where were you wounded? —Where were 
you when you got this bullet wound? Were you over- 
seas? —Is it on your right foot or your left foot? —Oh, 
I remember now, it is on your right heel. —What year 
was that? 

Pt.: 195—1945. 

Dr. C.: You got out of the army right after that? 
Pt.: Uh-huh. 

Dr. C.: Were you a good soldier? —And wasn’t that 
funny question? 

Pt.: I will let somebody else be the judge. 

[Again the patient demonstrates a dignity which is 
rather admirable. Although he is committed not to be 
committed to anything as a person, some individual 
characteristics still shine through.] 

Dr. C.: Were you a Private? Pfc? 

Pt.: P-v-t. 

Dr. C.: Were you a rifleman? 

Pt.: Uh-huh 
Dr. C.: Where were you—France? Germany? 

Pt.: I was in the Ruhr pocket. 

Dr. C.: You were there, huh? How did you feel 
about getting out? Were you tickled? —That is a little 
tricky question, too. 

Pt.: That is something I don’t have much to say 
about. 

Dr. C.: Say, when you say you can’t say yes or no 
as to why you don’t talk, is that because you don’t 
know or because it is so complicated you can’t explain? 
You know, it could be either way. It could be that 
you don’t know or it could mean that it could be 
complicated or it is too personal—either way. What do 
you say about that? Huh? —Do you like ball—base- 
ball? —Who won the World Series? —Do you remember 
the World Series? I don’t know whether you were here. 
How about it? How far did you go to school? 

Pt.: I didn’t quite finish the second year in high 
school. 

Dr. C.: How old were you, do you remember? 

Pt.: Nineteen. 

Dr. C.: You were the oldest child. Why did you 


quit school? 
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Pt.: Had to go to the Army 


Dr. C.: Did you say nineteen? The second year 
high school? 

Pt.: Yep 

Dr. C.: Had you gone to schooi regularly? 


Pt.: Every year. 


Dr. C.: How far did you live from your school? 

Pt.: Six miles 

Dr. C.: Did you ride the school bus? Consolidated 
school? 


Pt.: I don’t know whether you would call it a 
consolidated school or not 

Dr. C.: Did you go on the school bus, walk, or what? 

Pt.: Rode the school bus 

Dr. C.: Do you know how many students there were 
in the school—would you guess? Was it a big school? 

Pt.: The whole school? 

Dr. C.: Yes. 

Pt:: A hundred and fifty 

Dr. C.: It wasn’t too small 
just had to quit to go to the Army? 


You liked school and 


Pt.: Uh-huh 
Dr. C.: Were you drafted? 
Pt.: Yep 


Dr. C.: This effect of the sodium amytal—it makes 
you feel a little sleepy, of course, but how about moving 
your hand? Can you move it as well as you used to? 
Let us see vour grip here. How hard can you squeeze 
my head? —Not too hard. --Can you—going back— 
can you say why you can’t talk, whether it is too 
complicated, you don’t know, or because it is so per- 
sonal you just don’t want to talk about it? 

Pt.: One should talk when you are spoken to. 

[This is the closest that the patient comes to a 

reflective comment about himself and his situation. 

Yet even at this point he can do no better than 

phrase it as a proverb, as a bit of collective or folk 

wisdom rather than as a personal view.] 

Dr. C.: Yeah—but well, that is when you are with 
people who aren’t your friends, but you have got to 
feel friendly towards people. Here we are, just here, 
we are just talking about these things. You are free to 
say anything you want to. You will find us pretty 
understanding people to talk to. —Did this come as a 
decision on your part not to—I mean, we know that 
there is nothing wrong with your mouth or your vocal 
cords, it is possible for you to talk. Now, is this a 
decision on your part not to talk? —Are you familiar 
with the old saying: what is it—‘‘you can lead a horse 
to ....” What is the rest of that? 

Pt.: “... horse to water but you can’t make him 
drink?” 

Dr. C.: What do you think of that? Huh? Think 
there is any truth in it? Huh? 

Pt.: Yeah 

Dr. C.: Okay, your patient. 

(At this point the interview was terminated. The 

patient fell asleep about five minutes later, from the 

effects of the barbiturate.) 


CONCLUSION 
Like most clinical data, this interview must 
speak for itself. For some listeners, it may be mute; 
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for everyone, it will point in the direction to which 
he is inclined. The comments that were made by 
the writer, as he leaned intrusively over the inter- 
viewers’ shoulders, are meant to suggest one way 
of understanding the strange, almost paradoxical 
phenomenon of a patient whose condition of mute- 
ness varied almost from moment to moment. The 
drug used in this interview is sometimes called 
“truth serum;” but the selectivity of this patient 
in regard to the “truth” that he reveals ought to 
serve as a caution to those who would expect 
startling results from subjects who are presumably 
in much better command of their faculties. 

It may be, of course, that the critical question 
asked of the patient—as to why he did not talk— 
was unanswerable by him for reasons which had 
little to do with his illness. Perhaps such a ques- 
tion, involving as it does an action as fundamental 
as language, posed as much difficulty for the 
patient as would the complementary question, 
asked of the man in the street, as to why he does 
talk. In any case, the retrospective analysis of 
case material as attempted here cannot lead us to 
conclusions, only to some sources of testable ques- 
tions. 

What, then, are the questions which are posed 
by the clinical data presented here? Three appear 
to stand out: 

1. Why does a powerful sedative make the 
catatonic patient talk? If his condition is one 
which is halfway between waking and sleeping, as 
is claimed by some writers, he should be put firmly 
to sleep by a sedative and awakened only by a 
stimulant. What is the nature of a catatonic stupor 
such that relaxing the patient leads him to begin 
communicating with others? 

2. If, as appears likely, the patient is selective 
in his silence, we are faced with the issue of will 
vs. impulse—that is, of whether a specific organiza- 
tion of his impulse life causes him to choose topics 
he will talk about, or whether he chooses the topics 
in order to maintain a certain state. A remarkably 
similar problem has been encountered recently in 
experimental studies of the phenomenon of sub- 
ception; whether the presumed nonconscious 
forces of the personality cause the subject to 
“recognize” and therefore to avoid certain stimuli 
before they come to awareness, or whether he 
selects the stimuli to be avoided and reports his 
choices in special ways. 

3. Now that some information is available, 
only inferentially, concerning the nature of the 
catatonic’s experience, how may we use it to frame 
new generalizations concerning the schizophrenic 
state? 

The proposal offered in this paper, supported 
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rhich by an analysis of the clinical data, is that the that the catatonic stupor is a state of vigilance in 
e by schizophrenic steps out of his own history and which his declaration of lack of commitment is 
nter undertakes to refrain from any commitment to guarded and preserved. Stated as hypotheses, 
this proposal as well as all of the other issues 
raised above, would appear to be amenable to 


igorous experimental investigation. 


way responsible action with others; that he then pre- 
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CRITIQUE AND NOTES 


A DETERMINER OF THE CENTRALITY OF A TRAIT IN IMPRESSION FORMATION 


DAVID T. BENEDETTI anp JOSEPH G. HILL! 


University of New Mexico 


SCH HAS SHOWN that the impression formed 
by an experimental subject (S), when he is 
given a list of traits descriptive of a hypothetical 
person, is not based equally upon the various 
given traits (Asch, 1946). Some traits are more 
“central” than others, i.e., they carry more weight 
in determining the impression than do “peripheral”’ 
traits. For example, warm-cold is more central 
than polite—blunt. 

In his studies Asch was not concerned with 
individual differences among his Ss. In his words, 
“‘We have not dealt in this investigation with the 
role of individual differences, of which the most 
obvious would be the effect of the subject’s own 
personal qualities on the nature of his impression”’ 
(p. 283). 

It is precisely this possibility of relationships 
among the traits of the perceiver and those of the 
perceived that prompted the present investigation. 
In passing, note that the term “perception” is used 
here in the broad sense of “apperception and cogni- 
tion,” as in Tagiuri and Petrullo (1958, p. x). 

A number of other workers have found that the 
perception of others is related to variables in the 
perceiver (e.g., Crockett & Meidinger, 1956; 
Fensterheim & Tresselt, 1953; Lindzey & Rogol- 
sky, 1950; Stagner, 1948). To our knowledge, 
however, no one has approached this relationship 
in the context of the Asch method. 

This method involves a laboratory procedure 
that appears remote from the real-life formation of 
impressions of perceivable persons. This observa- 
tion is mitigated, however, by an experiment that 
showed that warm-cold exhibited centrality in a 
situation which involved an observed, real person 
(Kelley, 1950). 

The present investigators were concerned 
specifically with the question of whether the ob- 
server’s possession of a given trait would influence 
the weight attached to that trait when perceived 
in another person. Accordingly, the hypothesis was 
erected that the centrality of a trait varies with 
the strength of that trait in the perceiver. 


METHOD 


Subjects. A total of 204 general psychology 
students at the University of New Mexico were 


1 J. G. Hill is now at the Los Alamos Scientific Lab 
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allocated to three groups of 68 as explained below. 
Ss were unaware of their classification and of the 
purpose of the experiment. 

Procedure. The trait pair sociable-unsociable 
was selected for experimental manipulation 
because a pilot study had indicated that this trait, 
which is objectively measured in the Gordon Per- 
sonai Profile? (Gordon, 1953), is high in centrality. 
The Gordon test was administered to all Ss, who 
were then divided into High, Middle, and Low on 
the basis of their Sociability scores. The lowest 
percentile in the High group was 73, and the 
highest percentile in the Low group was 37. 

Each group was subjected to the same experi- 
mental procedure, which followed Asch’s method. 
Ss were read a series of traits that described a 
hypothetical person; after the second reading they 
wrote a short sketch giving their impression of the 
person. A random half of each group, working 
under the “Sociable’’ condition, heard the follow- 
ing traits: intelligent, skillful, industrious, sociable, 
determined, practical, cautious. The other half of 
the group worked under the “Unsociable”’ condi- 
tion, hearing: intelligent, skillful, industrious, 
unsociable, determined, practical, cautious. As in 
Asch’s study, the two series were identical except 
for one varied trait, in this case sociable-unsociable 
in place of Asch’s warm-cold. 

The thumbnail sketches of the hypothetical 
person were procured only for the purpose of main- 
taining a parallel with Asch’s procedure. After the 
sketches were written the Ss were given a list, 
adapted from Asch, of 20 trait pairs, from each of 
which they selected the trait that best fitted their 
impression. This session occurred during the nine 
laboratory periods in the week following the ad- 
ministration of the Gordon test in the two large 
lecture sections; Ss were given no hint that the two 
events were related. The 20 trait pairs are given in 
Table 1, with the nonpreferred trait in each pair 
enclosed in parentheses; on the check list given to 
Ss, the preferred trait in each pair came first or 
second at random. 

The prediction. Operationally stated, our hypoth- 
esis predicted differential checking of preferred 
traits in the three groups. In terms of the analysis 

2 The following statements are illustrative of the 
Gordon items measuring sociability: “enjoys having 
lots of people around’; “a good mixer socially’; 
“finds it easy to make new acquaintances.” 
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TABLE 1 
NomsBers OF HiGH, MrppLe, AND Low SOcrIABILITY 
Supyects ATTRIBUTING PREFERRED TRAITS 
UNDER THE SOCIABLE AND UNSOCIABLE 
CONDITIONS 
(N of each subgroup = 34) 


High Middle Low 
Subjects Subjects Subjects 
Trait — 

& 4 rs) 

: 2 2 3 : 2 

A =) A ~ A ==) 

Generous (ungenerous 33 5 32 6 32 15 

Happy (unhappy 32 ¢ 31 32 22 

Good-natured (irritable)| 32 5 33 ) 31 15 

Humorous (humorless) 31 4 30 5 30 3 

Popular (unpopular) 33 3 32 + 33 6 

Reliable (unreliable) 34 33 34 33 33 34 

Humane (ruthless) 31 10 32 17 30 27 

Good-looking 32 4 30 10 30 15 

unattractive) 

Persistent (unstable 34 30 34 31 34 34 

Talkative (restrained 32 9 33 15 32 20 

Altruistic(self-centered), 31 4 19 13 20 14 

Imaginative (hard 29 6 25 13 27 22 
headed) 

Strong (weak) 34 28 32 25 30 26 

Honest (dishonest) 33 34 34 34 33 34 

Bold (timid 30 5 32 il 29 21 

Mature (immature) 34 10 33 30 32 32 

Relaxed (tense 32 6 27 9 24 10 

Decisive (indecisive) 34 29 32 31 31 34 

Broad-minded (narrow- | 33 5 31 19 31 21 
minded 

ire (insecure 33 10 29 i9 31 27 

Mean 32.35; 12.30) 30.75) 17.00; 30.25) 21.60 


TABLE 2 
SUMMARY OF ANALYSIS OF VARIANCE 


, Mean 
Source d Square PF 
Groups 2 130.44 
Conditions 1 | 6006.67 | 18.42 
Groups X Conditions 2} 326.10} 6.35* 
Within Groups 114 51.34 
*p< Ol 


if variance, a significant F either for Groups or for 
Groups X Conditions interaction would confirm 
this prediction. 


RESULTS 


Table 1 gives the number of Ss in each subgroup 
who checked the preferred traits, and Table 2 
gives a summary of the analysis of variance of these 
data. 

The significant F for the Groups X Conditions 
interaction confirms the prediction: the classifica- 
tion of the Ss by sociability was a source of real 
differences which were contingent upon the condi- 


tion under which Ss worked. Note, in Table 1, 
that the mean number of Ss attributing the pre- 
ferred traits under the Sociable condition remained 
virtually constant across the three groups (32.35, 
30.75, 30.25), while under the Unsociable condition 
the corresponding means rose from 12.30 in the 
High group, through 17.00 in the Middle group, to 
21.60 in the Low group. 

Neither Groups nor Conditions achieved signifi- 
cance independently. The F for Conditions, as 
tested against Interaction since the latter is signifi- 
cant (see Dixon & Massey, 1951, pp. 137-138, or 
Walker & Lev, 1953, p. 360), falls barely short of 
the value required for significance at the .05 level; 
in any event, Conditions as a primary variable was 
of no concern. 


DISCUSSION 


The hypothesis that the centrality of a trait 
varies with the strength of that trait in the per- 
ceiver receives qualified support with respect to 
the trait pair sociable-unsociable as tested in the 
present manner. 

The measured sociability of our Ss was signifi- 
cantly related to the impressions they formed, 
defined in terms of the frequency with which pre- 
ferred traits were attributed to the described per- 
son. This relationship was, however, confined to 
the situation in which the person was described as 
unsociable. Under this condition, Ss formed im- 
pressions that varied in the High, Middle, and Low 
Sociability groups: while Low Sociability Ss were 
relatively generous in assigning preferred traits to 
the “unsociable” person, the High Sociability Ss 
were more reluctant to do so, and the Middle 
Sociability Ss intermediate. 

The description of the hypothetical person as 
sociable produced in all three groups an impression 
so favorable that the sociability classification of 
the Ss was, so to speak, washed out. 

Whether this asymmetrical effect obtains in 
connection with traits other than _ sociable- 
unsociable remains for future research to deter- 
mine. 

The results of the present study, in indicating 
that one’s own sociability may influence the 
centrality of that trait in the forming of impres- 
sions of others, enhance the possibility that 
“person perception” may involve other relation- 
ships between the traits of the perceiver and those 
of the perceived regarding the saliency and weight 
of those traits in the resulting impressions. 
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THE PSYCHOLOGICAL MEANING OF ACQUIESCENCE SET: 


FURTHER EVIDENCE 


N. L. GAGE anp B. B. CHATTERJEE 


Bureau of Educational Research, University of Illinois 


re PAPER strengthens an earlier hypothesis 
(Gage, Leavitt, & Stone, 1957) by means of 
a more appropriate method and correspondingly 
more significant results. The hypothesis was that 
acquiescence set—the tendency to say Yes or 
Agree when in doubt—partakes of “authoritarian 
submission”’ or ‘“‘a submissive uncritical attitude 
toward idealized moral authorities of the ingroup” 
(Adorno, Frenkel-Brunswik, Levinson, & San- 
ford, 1950, pp. 230-232). 

From this hypothesis it was reasoned that either 
Agree or Disagree responses to positive items— 
items expressing nonauthoritarian beliefs or 
attitudes—would be equivocal in their significance. 
“Logically,” agreement with such items would 
betoken nonauthoritarianism. “Psychologically,” 
however, agreement with such items, especially 
when the statements were vague and ambiguous, 
might reflect the authoritarian submission trend. 
The same ambiguity would characterize Disagree 
responses to positive items. 

On the other hand, an Agree or Disagree re- 
sponse to a negative item—a statement expressing 
authoritarian beliefs or attitudes—would be un- 
equivocal in its significance for the personality of 
the respondent. Both “logically” and “psychologi- 
cally,” by making the basic assumption, Agree 
responses could be considered to indicate authori- 
tarianism, while Disagree responses would un- 
equivocally signify the opposite. 

In short, responses to positive items could mean 
one thing logically and the opposite psychologi- 


cally, while responses to negative items would 
mean the same thing both logically and psychologi- 
cally. Accordingly, it was predicted that negative 
items have greater validity than positive items for the 
measurement of authoritarianism. 

In the paper referred to, this hypothesis was 
tested not with the F scale, which contains only 
negative items, but with a similar instrument, the 
Minnesota Teacher Attitude Inventory (MTAI) 
(Cook, Leeds, & Callis, 1951). The MTAI con- 
tains 38 positive items and 112 negative items. As 
an external, highly reliable, and readily available 
but far from perfect criterion of authoritarianism 
against which to determine the validity of the 
MTAI items, the original study used pupils’ 
ratings of their teachers on a 48-item “My 
Teacher” inventory (Leeds & Cook, 1947, p. 154). 

In this original study, the hypothesis was sup- 
ported, but only weakiy. The validity of the posi- 
tive items, for 97 teachers of Grades 4—6, was .13; 
the three randomly made sets of negative items 
had a mean validity of .25. The difference between 
the two 7’s of .13 and .25 was reported as barely 
significant at the .05 level, using Hotelling’s ¢ test 
(1940). In a subsequent computation of the sig- 
nificance test, the mean (.69) of the three separate 
rm's between the positive set and the three sets of 
negative items was used, instead of the single r2 
(.73) between the positive set and the total set of 
112 negative items. This change in re; made the 
difference between ry. and 13 slightly less signifi- 
cant; i.e., as is shown for the Della Piana-Gage 
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TABLE 1 


STATISTICAL SIGNIFICANCE OF THE REPLICATIONS AND COMBINED RESULTS 


ria (Mean, 3 4 Pos. 
(Validity Neg. Item Items vs. 3 Neg. p. x? Equiv. 
Source of Data Pos. Items) Validities) Item Sets) berys-ry) af, (one- tail) of p* afys 
Della Piana-Gage aa .25 69 1.53 94 .07 5.3185 2 
Mazzitelli .18 .12 .62 — .78 123 75 .5754 2 
Cook-Hoyt .14 ae .62 1.04 64 .14 3.9322 2 
Leeds 24 .27 51 31 97 3? 1.9885 2 
Combined results 11.8146* 8 


data in Table 1, ¢ was now 1.53 instead of 1.6,4 
and the one-tailed » was now .07 instead of .05. 


THREE REPLICATIONS 


This method of testing the hypothesis had a 
major weakness: it employed positive and negative 
items that had been highly selected on the basis of 
empirical evidence of their validity in discriminat- 
ing between teachers on the criterion measure. 
Such empirical selection of items should tend to 
equalize the validity of the positive and negative 
items and make any difference in their validity 
hard to demonstrate. This limitation of method 
showed itself in three replications of the original 
study. 

Highly comparable sets of data were available 
from studies by Mazzitelli (1957), Cook and Hoyt? 
(1958), and Leeds (1946); these authors generously 
made their data available to us. The results of 
these replications, together with that of the original 
study, are shown in Table 1. As before, the number 
of positive items was 38, and the 112 negative 
items were divided at random into three sets of 38, 
38, and 36. As is shown in Table 1, three of the 
four studies yielded results in the direction of the 
hypothesis, but none of the obtained differences 
between ry. and ris reached the .05 significance 
level. When the chi square model for testing the 
significance of combined results was applied, p 
was .17 (Jones & Fiske, 1953). 

Clearly, it seems unlikely that a statistically 
significant difference in validity will be found 
between the positive and negative items of the 
published version of the MTAI. Our explanation 
of this is that the MTAI’s positive and negative 
items have been highly selected and hence to some 
extent equated, on the basis of their validity in 
differentiating among teachers on a variable sub- 
stantially like authoritarianism. 


‘W. W. Cook & C. J. Hoyt, Personal communica 
tion, 1958. 


A More APPROPRIATE METHOD 


Rather than reject the hypothesis on the basis 
of these results, we sought data more appropriate 
for a test of it. Such data proved to be available 
in Leeds’ (1946) doctoral dissertation. In the 
course of that study, Leeds wrote 754 items aimed 
at eliciting attitudes of teachers relevant to their 
ability to get along with their pupils on nonauthor- 
itarian terms. He divided these 754 items into 
two forms of a try-out instrument, Forms A and B. 
Each form was composed of approximately equal 
numbers of positive and negative items. Further- 
more, most of the positive items on Form A re- 
appeared in negative versions on Form B, and 
most of the negative items of Form A reappeared 
in positive versions on Form B. 

These two forms were administered by Leeds 
some weeks apart to 200 teachers of Grades 4-6. 
One hundred of these teachers had been identified 
by their principals as highly superior in getting 
along with pupils, in maintaining a good classroom 
atmosphere, and so on; the other 100 had been 
identified by their principals as extremely inferior 
in these same respects. 

Leeds made available distributions of the supe- 
rior and inferior teachers’ responses to 738 of the 
754 items. It was hence possible to compute a 
validity index for these items (Mazzitelli, 1957). 
This was done by assigning weights of 1-5 to each 
of the five response alternatives (agree strongly, 
agree, undecided, etc.) and obtaining two sums of 
these weights, one for the responses to a given 
item by the 100 superior teachers and the second 
for the responses by the 100 inferior teachers. The 
index of item validity was the absolute difference 
between the sum of the response weights for the 
100 superior teachers and that for 100 inferior 
teachers. 

Prior to this computation of validity indices, 
each item was classified as “positive” or “‘nega- 
tive,” i.e., as stating a nonauthoritarian or an 
authoritarian belief or attitude, by two graduate 
students in education who had been given brief 
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TABLE 2 
Vatipity INpices OF PosITIVE AND NEGATIVE 
Irems In Leeps’ Forms A anv B 


Form A Form B 


Validity Index Positive Negative 


Positive Negative 


Items Items Items items 
75-79 1 1 
70-74 
65-69 1 1 
60-64 
55-59 I 1 
50-54 3 4 1 
45-49 + 13 2 7 
35-39 6 i) 2 17 
30-34 5 22 8 15 
25-29 18 22 18 2% 
20-24 22 29 21 21 
15-19 30 22 16 20 
10-14 26 23 34 34 
5-9 19 21 22 30 
O-4 39 17 39 24 
No. of items 176 191 165 206 
TABLE 3 


Cur Square OF Jornt Distripotions or ITEM 
Vautipity INDICES AND Posit1vE—-NEGATIVE 
CHARACTER OF ITEM 


Form A Form B 
Pos Neg. Pos. Nes. 
Items Items Items Items 
Higher Validity 62 108 70 118 
Lower Validity 114 83 95 88 
x? = 16.75, x? = 8.12, 
p < .0O1; p< 01 


definitions of these categories. These two judges 
agreed in their classification of 707 or 94 per cent 
of the 754 items. The 47 items on which the judges 
disagreed were categorized by a third judge (Gage), 
without knowledge of their validity indices. 

Table 2 shows the frequency distributions of 
these validity indices for the positive and negative 
items of Forms A and B. Note that for both forms 
the central tendency of the negative items is obvi- 
ously higher than that of the positive items. 
Splitting the two distributions of validity indices 
for each form at approximately the median of the 
combined distribution for that form yields the 
2 X 2 tables shown in Table 3. Chi square is 16.75 
(p less than .001) for Form A and 8.12 (# less than 
.01) for Form B. 


SUMMARY AND CONCLUSIONS 


This paper provides further evidence on the 
hypothesis that “‘negative’’ items—those stating 
authoritarian beliefs or attitudes—are more valid 
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than “positive’’ items for measuring authoritarian- 
ism. First, it is shown that it is inappropriate to 
test this hypothesis with the highly selected posi- 
tive and negative items of the published version of 
the Minnesota Teacher Attitude Inventory. A 
test of the combined significance of four sets of 
results using the published MTAI failed to yield a 
probability low enough to cast doubt upon the null 
hypothesis. 

A more appropriate test of the hypothesis would 
compare the validity of positive and negative 
items not previously selected on the basis of their 
validity. Such a comparison was carried out using 
738 items originally written by Leeds in the process 
of constructing the MTAI. The criterion of item 
validity consisted of principals’ judgments of the 
effectiveness of teachers in maintaining a non- 
authoritarian classroom atmosphere. The results 
for both Forms A and B, each consisting of positive 
and negative items in approximately equal pro- 
portions, are consistent with the hypothesis. In 
both forms, the proportion of the negative items 
with validities above average is very significantly 
higher than that of the positive items. 

Further replication, employing the best avaii- 
able sets of positive and negative F scale items 
(e.g., Christie, Havel, & Seidenberg, 1958), is in 
order. The validity of the items should of course 
be estimated against one or more external criteria 
of authoritarianism. By this method, rather than 
by mere internal analyses of relationships between 
positive and negative items themselves, more con- 
clusive evidence can be obtained on the issue 
whether acquiescence can be identified in pari 
with authoritarianism. Such replication will shed 
light on whether the superior validity of negative 
items here found stems from the item-writing 
idiosyncrasies of one test author (Leeds) or whether 
it reflects a basic similarity in psychological mean- 
ing between acquiescence set and the authoritarian 
submission trend. 
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HEAD-BODY DIFFERENTIATIONS IN BODY IMAGE AND SKIN 
RESISTANCE LEVEL! 


SEYMOUR FISHER 


Baylor University College of Medicine and Houston State Psychiatric Institute 


REVIOUS STUDIES have shown that the defi- 
P niteness an individual ascribes to his body 
boundaries is correlated with the relative physio- 
logical reactivity of his body exterior vs. his body 
interior (Fisher & Cleveland, 1958). Similarly, it 
has been demonstrated that the size or strength 
attributed by the individual to his right vs. left 
body sides is correlated with the relative GSR 
reactivity of the two sides (Fisher, 1958, 1959). 
Such findings have been interpreted as supporting 
the theory that body image attitudes influence the 
reactivity of body areas. 

The present study was undertaken to investigate 
a new dimension of body image and its relationship 
to the reactivity of given body areas. This new 
body image dimension involves the character of 
the distinction between the head area and the 
non-head area. It was conjectured on the basis of 
previous observations (Fisher & Cleveland, 1958; 
Reich, 1949) that one of the basic problems in 
body image organization relates to the degree of 
importance to be assigned to the head vs. the body 
in the over-all body schema. It was therefore 
hypothesized that there are individual differences 
such that at one extreme the head is experienced 
as relatively large and the body area as relatively 
small and at the other extreme the opposite pat- 
tern prevails. Further, it was hypothesized that 
the greater the relative size attributed to one of 
these sites as contrasted to the other, the greater 
is its physiological reactivity with reference to the 
other. 


‘ Supported by USPH Career Research Investigator 
Grant (M 1578). 


METHOD 
Body Image Measure 


The measure of head—body differentiation was 
based on size estimates involving head width and 
body width. Each S was taken into a dark light- 
proof room and asked to stand 6 ft. from two 
vertical luminous rods which were on a base 40 in. 
from the floor. The rods were 23 in. in length and 
l¢ of an in. in width. They were initially set so as 
to be touching, without an intervening space be- 
tween them. Each S was told that there were two 
rods and that they would be gradually moved 
apart. S was to indicate when he judged the space 
between them to be equal to the difference between 
the width of his head and the width of his body. 
Obviously, the greater the space between the rods 
the smaller was the magnitude he assigned to the 
head in the head—body comparison. The index 
used was the average of three judgment trials. 

The precedent for using size judgments to de- 
termine the relative prominence of body areas in 
the body schema has already been established in 
previous studies (Fisher, 1958, 1959; Fisher & 
Abercrombie, 1958). It should be emphasized that 
such size judgments have been successful only 
when obtained in a setting which maximizes the 
projection of the autistic components of body size 
concepts. Projection was encouraged in the 
present study by obtaining judgments in a dark 
room where Ss could not use ordinary visual cues 
from their own bodies or those of other individuals 
as an anchoring frame of reference. It was en- 
couraged also by the fact that instead of requesting 
Ss simply to estimate the sizes of their heads or 
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bodies separately, they were asked to make a more 
complex decision which involved not only an 
estimate of the magnitude of each area, but also a 
simultaneous comparison of them. Preliminary 
observations had shown that Ss made much more 
unrealistic judgments under these comparison con- 
ditions than under conditions calling for separate 
estimates of head and body width 


Head Area-Body Area Reactivity 


Skin resistance measure of 
relative physiological reactivity of head region vs. 
non-head region. A Brush direct writing oscillo- 
graph was employed. A constant current supply of 
20 microamperes and a DC amplifier were used 
for measuring the voltage across the S. The record 
was calibrated directly in ohms. Separate balanced 
systems were utilized for the head and non-head 
measures. The area of recording from given sites 
was made equivalent by using pieces of tape with 
two holes, each of 4 in. diameter, punched into 
them. Two minutes of recording were secured 
simultaneously from any pair of head vs. non-head 


was used as the 


sites 

The non-head area recording was taken from 
the dorsal surface of the middle segment of the 
middle finger of the left hand. This arbitrary 
choice was based on such factors as ease of apply- 
ing electrodes and similarity of the skin in this 
area to facial skin. More central portions of the 
body were not evaluated because of the practical 
difficulties of dealing with regions which are 
covered by clothing in both sexes. 

Three separate recording points were designated 
for the head. Site 1 was 4 in. directly above the 
glabella. Site 2 was on the left cheek, 24% in. 
directly lateral to the nasal septum. Site 3 was in 
the middle of the chin, half way between its lowest 
point and the lower lip of the mouth. Three sites 
were used for the head because there were indi- 
vidual variations in facial contours which made 
site determination unreliable. The resistance read- 
ings were taken in turn from each of the head 
locations simultaneously with readings from the 
finger. 

An index descriptive of relative head—body 
reactivity was formulated that was equal to the 
ratio of body resistance to head resistance at that 
point in the recording where the maximum differ- 
ence between the two occurred. This index was 
developed on the basis of preliminary exploration 
which indicated that it correlated better with the 
head—body differentiation measure than did other 
resistance level representations. 

The test-retest reliability of the resistance index 
was evaluated by retesting 15 Ss. The intervals 
between test and retest ranged from one day to 
three months, with a median of two months. A rho 


AND NOTES 


of .87 between forehead—hand reactivity ratios was 
obtained. 
Subjects 

There were 62 Ss, 18 female and 44 male. Th« 
median age was 21, and the median educationa 
level was 15 years. Most of the Ss were recruited 
from sororities and fraternities and paid for their 
services. A small portion of the group included 
secretaries and laboratory technicians. 


RESULTS 


Scores for both the reactivity measures and th 
body image measures varied widely. The finger 
head reactivity ratios ranged from 40-to-1 to 
1-to-7. The median ratio was one in which the 
finger resistance was three times as great as the 
head resistance. 

Ss’ estimates of the difference in width between 
head and body varied from a low of 2 in. to a high 
of 17. The median was 6 in. 

Chi square indicated significant relationships in 
the predicted direction between the body image 
measure of head—body differentiation and the skin 
resistance ratio indices. As shown in Table 1, Ss 
above the median in degree of relative size at 
tributed to the head manifested significantly 
smaller ratios of head area resistance to non-head 
area resistance than did Ss below the median. This 
was true at the .001 level for the forehead—hand 
sites, between the .02 and .01 levels for the cheek 
hand sites, and at the .001 level for the chin-—hand 
sites. The clearest relationship between t! e body 
image measure and the reactivity ratio emerged 
when the reactivity ratio was computed on the 
basis of the forehead—-hand sites. Perhaps this was 
due to the fact that electrode placement is easier 
to standardize on the forehead than on the cheek 
or chin. 

It was noted that a larger proportion of the 
variance of the reactivity ratios contributing to 
the significant relation with the body image 


TABLE 1 
Cut Square ANALYSIS OF RELATIONSHIP OF 
Bopy IMAGE MEASURE TO VARIOUS 
Heap-Bopy Reactivity Ratios 


Variables Chi Square a 
Body Image vs. 
Forehead—Hand Ratio 9.2 .001 
Cheek—Hand Ratio 5.8 .02-.01 
Chin-Hand Ratio 8.4 .001 


* The direction of the results in each instance is such that 
the greater the size attributed to the head relative to the body, 
the lower is the resistance level of the head site as compared to 
the non-head site. 
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CRITIQUE AND NOTES 


measure was derived from the non-head skin re- 
sistance than from the head resistance. Thus, chi 
square indicated only a chance relationship be- 
tween forehead skin resistance alone and the body 
image index. However, the skin resistance of the 
hand alone was linked at the .001 level with the 
body image measure; thus, the greater the relative 
size ascribed to the head, the greater was the hand 
skin resistance. The question therefore arose of 
whether the non-head resistance measure alone 
might be the equivalent of the ratio index. A rank 
order correlation between the ratio index (fore- 
ead-hand) and the non-head resistance measure 
alone yielded a value of .69. Further, the ratio 
index (forehead-hand) was correlated (rho) .29 
01 level) with the body image measure; the 
analogous correlation for the non-head resistance 
measure was .34 (.01 level). It consequently 
appears that the ratio index and the non-head 
resistance measure are similar. They are not 
equivalent, however, and the ratio index seems to 
be slightly better correlated with the body image 
variable than the non-head resistance measure 
alone. For these reasons and because it makes more 
sense in theoretical terms, the ratio index was 
considered the most appropriate representation of 
physiological reactivity. 
DISCUSSION 

[he data support the hypothesis that different 
levels of reactivity in given body sectors are linked 
with corresponding differences in body image atti- 
tudes toward such sectors. Thus, when the head 
area was perceived as having large magnitude with 
reference to the body, it was characterized by 
relatively lower skin resistance (indicating greater 
reactivity) than that of the non-head area. But 
head area was judged as being small 
with reference to the body, it was found to have 
relatively higher skin resistance. These findings 
are congruent with previously reported correlations 
between body image boundary definiteness and 
body interior vs. body exterior reactivity and also 


when the 


between body image measures of right and left side 
body differentiation and right vs. left GSR reac- 
tivity. It is a speculative step to deduce from the 
significant relationships between body image and 
ody reactivity that body image attitudes toward 
a given body sector may play a causative role in 
the level of reactivity of that sector. The signifi- 
cant relationships could possibly be due to some 
third variable, or varying levels of reactivity in 
body areas may give rise to differential judgments 
it comparative size. 

The possibility that some third variable plays a 
major part in the observed relationships must 
continue to be seriously considered. However, the 
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idea that body image differentiations between body 
sectors simply grow out of sensation or reactivity 
differences between these sectors is strongly 
questioned by two previous findings. First, it has 
been shown (Fisher & Cleveland, 1958; Fisher & 
Fisher, in press) that the individual’s attitudes 
toward his body sectors (viz., inside vs. outside 
and right vs. left) are significantly linked with past 
socialization experiences as they are reflected in 
his images of the parental figures. It is difficult to 
see why such a relationship could exist if a body 
image attitude toward a localized body area were 
due simply to the reception of body sensations 
from that area. Second, it has been demonstrated 
(Fisher & Cleveland, 1958) that the mere experi- 
encing of intense sensations from a body area 
(e.g., persistent muscle pain), even for a period of 
years, does not significantly affect basic body 
image attitudes of the sort which have here been 
under consideration. 

It is becoming increasingly clear that there are 
not one or two important dimensions of body 
image but rather a complex array. There is also a 
corresponding array of differences in physiological 
reactivity between various body sectors. The task 
remains of determining the full range of important 
body image dimensions and how they are related 
to each other. 


SUMMARY 
This study concerned itself with testing the 
hypothesis that the relative physiological reac- 
tivity of the head area vs. the non-head area is 
linked with the relative size representation as- 
signed to each in the body schema. Size estimates 
concerning the head and body were obtained from 
62 Ss in an unstructured situation. Also, skin 
resistance levels were obtained from head and 
non-head sites simultaneously. The over-all results 
were clearly supportive of the hypotheses. 
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EXTRINSIC RELIGIOUS VALUES AND PREJUDICE 


W. CODY WILSON 


Harvard University 


GROWING body of empirical evidence in 


A that 
according to various 
prejudiced than those who are not “religious” 
(Adorno, Frenkel-Brunswik, Levinson, & San 
ford, 1950; Allport & Kramer, 1946; Levinson, 
1954; Pettigrew, 1956; Rosenblith, 1949; Stouffer, 
1955; Turbeville & Hyde, 1946) 

Adorno, et al. (1950 
analysis of some of their data, suggested that the 


dicates people 


criteria tend to be 


, as a result of qualitative 


fact of acceptance or rejection ol religion is not as 
important as the way which it is accepted or 
distinguished between a 
Li conventional, externalized, and 
neutralized and one that is more personal and 
internalized. Allport 1954) also 
motivation for religious affiliation, postulating 
that those who affiliate with a religious institution 


rejected (p. 217); they 


at is 


religion tl 


emphasized the 


for utilitarian, self-serving purposes—those with 
tend to be prejudiced. 


leads developed in 


“extrinsic” religious values 


Levinson, following the 


The Authoritarian Personality, devised a Religious 


Conventionalism Scale which records the extent 


of agreement or disagreement with statements of 


conventional religious beliefs about the church, 


the Bible, God, prayer, etc 
and the Califorr 


Correlations between 


this scale a Ethnocentrism Scale 


of the general magnitude of .50 were reported 


(Levinson, 1954) for samples heterogeneous in 
terms of religious affiliat 
This paper data concerning another 


scale of religious values, 


affiliated with a 


rep rts 


| designed for se 


with 
religious institution 
Adorno, et al. (1950) 


it with the 


individuals 
und reflecting the ideas of 
und Allport (1954), and « 
Religious Convention: 


mpares 
ilism Scale 


The Extrinst 


Religious Values Scale’ 
rhe instrument is composed of 15 dichotomous 
These items are of two general types: (a) 


one subgroup reflects 


tems.” 
in allegiance to, and de 
external or institutional 


the other subgroup 


pendence upon, the 
tructure of a church; and (6 
reflects a utilitarian orientation toward religion, 
i.e., acceptance of religion as a means. One of the 
alternative each item reflects 


two responses to 


‘The aid of Kay Wilson and G. W 
invaluable in the formulation of these items 

?The Extrinsic Religious Values Scale has 
deposited with the American Documentation In- 
stitute. Order Document No. 6135, remitting $1.25 for 
35-mm. microfilm or $1.25 for 6 by 8 in. photocopies 


Allport was 


been 
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who are “religious” 
more 


an “extrinsic religious value”’; the other alternative 
is not positively defined but simply reflects the 
absence of extrinsic religious value. The score 
on the instrument is the sum of the items for which 
the “extrinsic” alternative is preferred. 

Two examples of the items of the Extrinsic 
Religious Values scale (the asterisk indicates the 
alternative that reflects the extrinsic value) are: 
Prayer is, above all else, a means of obtaining 
needed benefits, protection, and safety in a 
dangerous world. 

* On the whole, I agree. 
On the whole, I disagree 
Some people say that they can be genuinely 
religious without being a member of any church. 
__ Yes, I agree that they can. 
* I do not believe this. 

The scale has a corrected equivalent-halves 
reliability of between .80 and .85 for homogeneous 
Protestant samples with Ns in the range from 15 
to 35. 

METHOD 

The Extrinsic Religious Values Scale (ERV), 
a 12-item version of Levinson’s Religious Con 
ventionalism Scale (RC), a 12-item version of the 
California Anti-Semitism Scale (AS), and filler 
items were assembled into a questionnaire with 
typical instructions. The total questionnaire 
contained approximately 50 items and required 
about 25 min. for 90% of people with a high 
school education to complete. The questionnaire 
was administered to the following small, relatively 
homogeneous groups at regularly scheduled or- 
ganizational meetings: 

1., 2., 3., 4. Adult 

Congregational Christian churches in the 

metropolitan Boston area (V = 15, 22, 22, 

and 35 respectively) ; 

5. Members of a young adult organization in a 

Methodist church in the Boston area (VN = 10); 

6. Harvard-Radcliffe students of various 

Christian denominations who reported attending 


members of four different 


services regularly (V = 26); 

7. Adult members of a Disciples of 
church in North Carolina (VN = 11); 

8. Adult members of the choir of a Negro Bap- 
tist church in the Boston area (V = 15); 
9. Sophomore students in a Catholic 
college in the Boston area (NV = 23); 

10. Catholic graduate students in the Boston 
area (NV = 238). 
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TABLE 1 
RELATIONSHIP BETWEEN EXTRINSIC RELIGIOUS VALUES, RELIGIOUS CONVENTIONALISM, AND 
ANTI-SEMITISM 
Group N hee ERV RC AS Correlations® 
Mean SD Mean SD Mean SD ERV-AS RC-AS | ERV-RC 
= = ‘ 
1 15 .80 7.0 3.0 68.1 9.6 48.0 21.8 << .39 -oi* 
2 22 .82 6.3 2.0 — 36.0 12.5 . oem —?b —-t 
3 22 72 7.3 2.0 64.5 10.1 44.3 11.9 — .08 08 
{ 4 35 .85 ta 2.6 60.5 9.5 47.7 15.9 -_- .29 .20 
5 10 81 5.2 1.8 -b 35.0 11.2 .68* -——» -—b 
6 26 85 5.0 1.9 52.5 17.7 33.2 12.7 — .41* 17 
7 11 86 7.9 2.3 —» 51.6 15.1 mF 3 —b —> 
; 8 15 82 9.2 , 69.0 8.4 41.0 10.8 .64** 31 
9 23 57 6.6 1.7 78.2 6.3 40.8 10.7 .41* . Boe. x 
10 28 51 6.1 1.7 75.9 5.5 36.0 9.1 .43* . ae aa 


Note.—The following abbreviations are used in the column headings: ERV, Extrinsic Religious Values Scale; RC, Religious Conven- 


tionalism Scale; and AS, Anti-Semitism Scale 


* Corrected equivalent halves. 


> The Religious Conventionalism Scale was not administered to this sample 


® All correlations are by rank-difference method 
“P< 0S 

"P< Ol 

see P< .001 


It should be noted that these groups, totaling 


207 subjects, cover a wide range of religious, 


ethnic, and geographic variation. 


RESULTS 


re 


The relation between the Extrinsic Religious 
Values Scale and Anti-Semitism Scale was 
positive and significantly greater than zero for 
each of the 10 samples as shown in Table 1 

The Extrinsic Religious Values Scale has a 
consistently larger relationship to the Apnti- 
Semitism Scale than the Religious Con- 

ntionalism Scale {columns 10 and 11, Table 1); 
the ERV-AS correlation is greater than the RC-AS 
correlation for each of the groups that 
responded to all three scales (P = .016, sign test). 


the 


dc cS 
seven 


DISCUSSION 


The reliability of the ERV scale, and conse- 
juently its relation to AS, is lower for the Catholic 
than for the Protestant samples 
columns 3 and 10, Table 1). This result 
tributable to the fact that approximately one- 
urth of the items in the ERV scale did not differ- 
entiate among the Catholics church 
ne dictates the responses to these items for 
lividuals affiliated with it. The ERV dimension 


groups (see 


is at- 
official 


ipparently may be measured, and it does correlate 
with AS, within the Catholic groups, but the scale 
needs to be revised for use with this population. 

The ERV-AS correlation, for the eight Prot- 
estant groups, ranged from .56 to .72 with a mean 


of .66. It may be expected that the actual or “‘true”’ 
relationship between extrinsic religious values 
and prejudice is considerably larger than these 
figures. If corrections for homogeneity and at- 
tenuation based on mean values in these areas are 
applied to the mean correlation between ERV 
and AS, the corrected coefficient is .80. Further, a 
qualitative inspection of the data indicated that 
the individuals who contril uted the most to the 
lowering of the relationship between ERV and AS 
were several who scored relatively high on the 
ERV scale but extremely low on the more direct 
AS scale. The reverse, that is, low ERV and high 
AS scores, did not occur nearly so frequently. 
It is likely that the validity of the ERV scale is 
greater than that of the AS scale because its 
content is more ambiguous in terms of social 
acceptability. The relationship between extrinsic 
religious values and prejudice is, therefore, perhaps 
greater than indicated by the relationship between 
the two scales. 

The Extrinsic Religious Values Scale appears 
to be twice as effective as the Religious Con- 
ventionalism Scale in terms of predicting, or ac- 
accounting for variance in, prejudice within Prot- 
estant groups. If the correlations between ERV 
and AS, and RC and AS (Columns 10 and 11, 
Table 1), are transformed into V/1 — rf or r’, 
then the value for ERV-AS is more than twice 
that for RC-AS in all five of the Protestant groups 
that responded to all three of the scales (P = .031, 
one-tailed sign test). 
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It may be noted that the RC-AS correlations 
reported in this study are smaller than the correla- 
tions between RC and prejudice previously re- 
ported in the literature. This study used samples 
rather homogeneous with respect to religion and a 
where previous 

hete 


with respect to religion and a generalized prejudice 


specific measure of prejudice, 


studies typically used samples ogeneous 
scale 

surprising finding of this 
study, in view of the fact that both the ERV and 
RC scales were based in part on the insights 
developed in The Personality, is 
the low relationship between the ERV and the 
RC Table 1). On the other 
hand, as the name the Religious Con- 
ventionalism Scale concentrates upon the content 
the 


atte mpts to 


Perhaps the most 


{ uthoritarian 


scales (column 12, 


implies, 


of religious beliefs whereas Extrinsic Re 


ligious Values Scak measure more 


directly the motivations for affiliating with a 


religious institution 
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THE KNOWLEDGEABLE NEWCOMER UNDER CONDITIONS 


OF GROUP SUCCESS AND FAILURE! 


ROBERT C 


ZILLER anp RICHARD D 


BEHRINGER? 


University of Delaware 


HETHER members of a group perceive a 
W newcomer as facilitating or obstructing 
may be determined, in part at least, by the needs 
of the group (Bruner & Postman, 1949). It may 
be assumed that members of successful groups in 
ose of groups that have experienced 
failure group as a self-sufficient, 
self-contained, and complete social unit. However, 


contrast to tl 
perceive their 


the introduction of a new member may precipitate 
examination of and changes in the organizational 
structure, the methods of sharing group rewards, 
and the group’s modes of adjustment to its en 
vironment. Thus, to a successful group, the new 
comer represents a potentially disruptive force 
threatening to stimulate changes in positively 


reinforced behavior. Unsuccessful groups, on the 


' This report is an extension of a paper read before 
the American Psychological Association meetings in 
Cincinnati, Ohio, 1959 

? Most of the statistical analyses were conducted by 
Paula Tammany 


other hand, may be expected to perceive the new 
comer as a possible resource and as an agent or 
catalyst of change. Thus, it was hypothesized 
that newcomers are accepted less readily by suc 
cessful groups than by groups with a history of 
failure. 
METHOD 

Subjects 

The subjects (Ss) were male and female volun 
teers from freshman English classes at the Uni- 
versity of Delaware. The 120 Ss were assembled at 
random in 40 three-person groups homogeneous 
in sex. 


Procedure 

The experiment was described to the Ss as a 
study of group decision-making. The three-person 
host groups were required initially to solve two 
problems, a Meier Art Judgment problem and a 
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t-estimate problem, through the medium of a 
standard note-passing communication net of the 
sort used by Bavelas. Following an induction of 
group success or failure a new member was in- 
troduced to the group on the pretext of studying 
e effects of group size on decision-making. At this 
nt, the new group was presented with a second 
t-estimate task that was very much more diff- 

than the first. The newcomer was, in fact, a 
nfederate who knew the correct answer and a 
correct method for arriving at the answer. The 
criteria according to which assimilation of the 
newcomer was evaluated included the degree to 
which the group was persuaded to accept the 
correct answer, the degree to which the members 

lt that they had changed their initial estimates, 
three ratings of the newcomer by the host group 

smbers, and the number of notes addressed to 


> newcomer 
The inductions of success or failure followed 
the two initial tasks. Failure-groups were informed 
that their decision on the Art Judgment task was 
wrong and that on the dot-estimate task the group 
performed “only better than 10% of the groups 
who have participated in this task in the past . 
that is, nine out of ten groups that we have pre- 
yusly tested performed better than your 
roup....” Similarly, a “successful” group was 
formed that their decision on the art task was 
rrect and that the group performed better than 
X)% of the groups on the dot task. 
The dot display used for the third and critical 
problem presented ten times as many dots as 
the first task, contained in an irregular figure 
resembling a ping-pong paddle rather than the 
rectangular figure previously employed. The slide 
us exposed for only five sec. First, the members 
ibmitted individual judgments as to the correct 
imber of dots. Next, the group was permitted 
minutes in which to reach a group de- 
dots on the 
submitted 


fifteen 
number of 
each 


sion as to the correct 
card.” Finally, the members 
private estimates of the “number of dots they 
personally really thought there were on the slide.” 

rhe (the newcomers) received 
letailed about their Their 
members laconically com- 
and a correct 


confederates 

instructions 
nitial message to aill 
unicated the correct 
rithmetic process for arriving at it. Thereafter, 


roles. 
“estimate’”’ 


e confederates merely responded to inquiries 


vith prescribed messages reflecting profound 


confidence in their estimate but willingness to 
compromise for the sake of group efficiency and 
armony. Moreover, they were enjoined not to 
submit the final group estimate. It should be 
emphasized that the task was of such complexity 


that the naive Ss could neither estimate the correct 
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number of dots with confidence nor discover the 
correct approach. 

The method used by the confederates required 
them to focus on the rectangle of minimum di- 
mensions which enclosed the paddle-shaped dot 
display. The paddle-shaped figure covered ap- 
proximately one-half the area within the rectangle. 
Thus, the number of dots within the figure could 
be approximated by estimating the width and 
length of the rectangle in dots and dividing the 
product by two. 


Measures 


The dependent variables presumed to reflect 
degree of assimilation included measures of com- 
munication direction, group influence, and the 
relative ratings of the newcomer by the host- 
group members (Ziller, Behringer, & Jansen, 1959). 

The measure of communication direction was 
simply the total number of messages sent by the 
naive Ss to the newcomer. 

The first measure of group influence was the 
absolute difference between the group decision and 
the correct answer. The second measure of group 
influence was derived from the ratio of the degree 
of change in estimate in the direction of the correct 
answer to the degree of possible change. An upper 
limit of 1.00 is inherent in the index and a lower 
limit of zero was imposed. The resulting ratio was 
subjected to the arc sine transformation. The 
mean of the members’ transformed indices pro- 
vided the index of estimate change for a group. 

A phenomenological measure of influence was 
derived from the mean of the weighted responses 
of the naive members to the item: “I changed my 
individual estimate of the number of dots a great 
deal as a result of the group discussion.” The 
alternatives were arranged on a six-point scale 
varying from “agree very much” to “disagree 
very much.” 

Finally, the naive members rated themselves as 
well as the other group members (including the 
confederate) on nine-point descriptive scales 
regarding the statements: (a) This person’s ideas 
with reference to the dot-estimate task were of 
the very highest quality. (6) This person is ex- 
tremely pleasant and contributed a great deal to 
making this session enjoyable. (c) This person 
exerted great influence throughout the 
meeting. 


a very 


RESULTS 


The results regarding the newcomer’s influence 
(assimilation) are summarized in the first three 
rows of Table 1. In the analyses involving “group 
decision error” and “estimate change” (Rows 
1 and 2), a 2 X 2 factorial design was employed 








290 


CRITIQUE AND NOTES 


TABLE 1 
MEASURES OF INFLUENCE AND ACCEPTANCE OF NEWCOMER UNDER CONDITIONS OF GROUP 


SUCCESS AND FAILURE 


Success 
Group decision error 269.6 
Change in estimate 42.9 
Reported change 4.2 


Recipient 


New- |Mem New- Mem- 

comer) ber |Total comer; ber 

N (M N) (M 

Ratings of ‘‘ideas”’ 6.3 7.1 6.7 7.2 6.1 
Ratings of ‘“‘pleasantness”’ 6.8 7.6 7.2 7.4 7.7 
Ratings of “‘influence”’ 7.0 7.3 7.1 8.1 6.4 
Percentage of notes sent 34.4 | 37.8 | 36.1 | 33.3 | 36.6 


Note 
transformation 


including sex as a main effect. The decisions sub- 
mitted by the failure groups tended to be more 
accurate (more in agreement with the newcomer) 
than the decisions of successful groups; and the 
failure groups tended to change their estimates 
more in accordance with the newcomer’s estimate. 
However, only the latter results were statistically 
significant (p < .05). 

Regarding the phenomenological measure of 
influence (Row 3 the initial 
estimates were of varying degrees of accuracy, it 
was expedient to control for this 
variation by classifying the members of each group 
according to the relative accuracy of their initial 
estimates. The analysis of variance employed 
follows a technique described by Henderson (1959) 
for (analyses 
involving a comparison of individuals within the 
same group) and factorial classifications. Members 


since members’ 


considered 


combined nested classifications 


of successful groups reported less change of opinion 
than members of failure-groups (p about .0S). 
Moreover, the member whose initial estimate was 
least accurate tended to report greater change 
(p < .05). Thus, it may be presumed that the 
perceived change was in the direction of the new- 
comer’s position 

In regard to the rating 
and a regular group member were compared under 
all experimental conditions. The regular member 
for group was the 
whose nearly 


items, the newcomer 


selected comparison in each 


person initial estimate was most 
accurate or similar to the newcomer’s estimate 
rhus, the ratings by the members medium and 
low in accuracy of the newcomer and most ac- 
curate naive member were analyzed using the 


same complex analysis of variance technique. 


The measures involving a percentage, ‘‘change in estimate 


Failure Significance Tests 
99.2 
58.0 (p < .05) 
4.9 p about .05) 


Recipient 


New- Mem- 
Total comer) ber 
Total Total 
6.6 6.7 0.6 N X M, ? < .05 
7.5 7.1 7.6 | (N vs M, ~ < .05) 
7.3 7.6| 6.8) (NX M, p < .05) (N vs M, p < .05 
34.9 | 33.9 | 37.2 | (N vs M, p < .05) 


’ and “percentage of notes sent,” were converted by an arc sine 


(The results were not appreciably changed when 
ratings of either of the naive members medium or 
low in accuracy were used as controls rather than 
those of the most accurate naive member.) In 
teraction between success-failure and newcomer 
member status was significant (Row 4, p < 05 
Under the success condition, the regular member 
was rated higher than the knowledgeable new 
comer with reference to quality of ideas; but under 
the failure condition, the newcomer was rated 
higher than the regular member. Similar results 
were obtained concerning ratings of pleasantness 
(Row 5) and influence (Row 6); but only the latter 
results were statistically significant (pb < 05 
Moreover, the regular member was rated higher 
than the newcomer (f < 05) with regard t 
“pleasantness” but with regard to “influence’’ 
the newcomer was rated higher than the regular 
member (p < .05). 

Finally, the regular member received more 
notes than the newcomer from the remaining tw 
naive members (Row 7, p < .05). 


DISCUSSION 


The results appear to include three major 
findings. First, the knowledgeable 
tended to be more effective or influencial in 
failure groups than in the successful groups. More- 
over, the newcomer tended to be rated higher 
than matched regular members under conditions 
of failure; but under success conditions the results 
Finally, in general, the matched 


newcomer 


he 
Lit 


were reversed 
regular member tended to be rated higher than the 
knowledgeable newcomer. 

Clearly, the first two findings are mutuail 
supporting and are in accord with the initial 
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hypothesis: the newcomer is accepted to a greater 
degree by groups with a history of failure than by 
the successful groups. In accordance with our 
initial assumption, members of failure groups may 
perceive the newcomer as 2 possible resource and 
a possible stimulus to needed change, while suc- 
cessful groups may tend to perceive him as a 
restriction or supernumerary and a stimulus to 
potentially disruptive change. Since degree of 
assimilation was inferred from indices of influence, 
e results may indicate greater conformity in 
low prestige groups. Interpreted in these terms, 
the results of the present experiment are supported 
by an earlier experiment reporting greater con- 
rmity to a partner after previous failure on a 





f 


task (Mausner & Bloch, 1957). 

{ recent experiment by Mills (1957) may be 
nterpreted as supporting in part the former in- 
rpretation of the present findings. Mills found 
that when original members of the host group 

accepted one another, the value of the newcomer 

ninished. If members of successful groups tend 
to accept each other more than do members of 
failure groups (Deutsch, 1959), the findings of the 
two studies are consistent. 

We also find that the newcomer with the correct 
answer was regarded as less pleasant, but more 
nfluential than the original members, regardless 
f their relative accuracy. Moreover, relatively 

ver notes were addressed to the newcomer. 
These results may be attributed, in part, to the 
prescribed role of the newcomer. While the new 
comer was quick to offer “to go along with the 
group,” still, a degree of righteousness was in- 

erent in the role. It may be conjectured that the 
members reacted negatively to this in- 
ngruent, newcomer role component. Variations 
1e roles of the newcomers are suggested for 


ther research. 


SUMMARY 


was proposed that a new member may be 
perceived as either a resource or as an unwelcome 
source of disruption, depending upon the needs 


of the group. Moreover, it was assumed that 

members of a group with a history of failure per- 

ceive it as incomplete and wanting in resources. 

Thus, it was hypothesized that a newcomer is 

accepted more readily by a group with a history 

of failure than by a successful group. The experi- 
ment tested this hypothesis for the special case of 

a newcomer equipped with the solution to a 

problem otherwise too difficult for the group to 

solve accurately. The experiment tested this hy- 
pothesis for the special case of a newcomer 
equipped with the solution to a problem otherwise 
too difficult for the group to solve accurately. 

The experiment was conducted in a laboratory, 

note-passing setting and involved a newcomer 

who was, in fact, a confederate with the correct 
answer to the critical task and a method of arriving 
at that answer. Groups with a failure induction 
were persuaded by the newcomer to a greater 
extent than groups with a failure induction, and 
rated the newcomer higher than the regular group 
members. 
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